Bellfalasch / Migrate-2-WordPress-2.0

Migrate static websites to Wordpress by crawling/scraping the pages, saving them to a database, and then stripping and improving their html code before presenting you with an exported XML-file that you can import into WordPress.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Migrate 2 WordPress, 2.0 - BETA

Migrate your static files to Wordpress

This repo is loosely based on "Bobby CMS", which is not in development but works for the needs of this project (it's basically just a simple CRUD-system to generate forms for the database, with validation).

The main function of Migrate 2 WordPress (or M2WP for short) is to first crawl/scrape an old website (perhaps built with static html-files), then clean up / tidy the html code, and end it all by handing you a downloadable XML-file that can be imported straight into Wordpress.

This project is currently in open beta, until we'll reach the "Beta 1.0"-version. Check the CHANGELOG.md for more details.

It's highly recommended to use proper backups of your database before using this code, and expect big changes between each beta release, and no upgrade models.

Disclaimer

This project is not made to be working on every setup out there. I made it to assist myself in porting some old code from about 10 different sites. Don't expect it to automagically work on every kind of weird old setup. It's tested to work on WordPress 3.6 to 4.9, but doesn't take into account that WordPress export format might change in the future.

Currently it cannot handle URL's based on "folder"-style URLs (mysite/mypage/), as it expects a file ending to validate a file as crawlable. It doesn't handle JavaScript (or Ajax) at all, or Flash.

Also, don't expect it to produce perfect result from old code. It will do the best it can. You won't get away from having to manually editing some pages in the end anyway, but the amount of work is greatly reduced.

M2WP doesn't create any Posts, Menus, Images, or other "complex" things inside Wordpress. The export file you'll get in the end will only generate the page structure with its contents.

It won't support content spread into many different "blocks" / areas on a single page. It only supports one starting point, and then one ending point. Everything between them will be counted as content.

It's also currently based on a somewhat slow fopen-crawl, this might be changed in the future. We had no clue about Curl when we started the project.

Installation:

Look at the list of dependencies after this section. Make sure all is set up on the server. Open phpMyAdmin (or similiar), create a database called "m2wp" (or similar) and execute everything in the included file "/DATABASE.sql". Now upload all the files to your server (or localhost). It should work in any folder structure.

Log in with "admin@example.com" and "password" in the login form (you should be redirected automatically when you open the projects root folder). Be sure to change this password on public hosting.

  • Fork project
  • Create MySQL database
  • Run DATABASE.sql in it
  • Rename config-example.php to config.php
  • Edit config.php with your database details
  • Upload files to server
  • Login with admin@example.com and password
  • Change password

Dependencies:

This projects needs the following to run (and is tested on):

PHP

  • Version 4.3.10
  • Settings: short open tags = true
  • Settings: allow url fopen = true
  • Extensions: php_mysqli = ON
  • Extensions: php_tidy = ON

MySQL

  • Version 5.5.20
  • InnoDB used as engine

Bundled

These bundles are in use (among others), all included in the project itself:

I use Bootstrap 2.x mainly because I started the project when that was the latest and greatest, and I have not changed because I don't like the new flat style of Boostrap 3.

Basic file structure and form-generation:

Check out the readme for Bobby CMS if you need more information on basic functions, structure, etc that this project uses. All forms are built using that project too. It has a way of setting up forms easily by defining arrays of options. Some code and boom - form for editing, adding, and validation is generated. It's far from finished, but at least it works good enough to be used here =)

About

Migrate static websites to Wordpress by crawling/scraping the pages, saving them to a database, and then stripping and improving their html code before presenting you with an exported XML-file that you can import into WordPress.


Languages

Language:PHP 63.8%Language:JavaScript 20.5%Language:CSS 15.7%