Ayms / node-htmlparser

Forgiving HTML Parser in JS for Node

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A forgiving HTML parser written in JS for Node.
Reads in imperfect HTML and spits out a simple object model.

Usage:
	var htmlparser = require("node-htmlparser");
	var rawHtml = "Xyz <script language= javascript>var foo = '<<bar>>';< /  script><!--<!-- Waah! -- -->";
	var htmlDom = htmlparser.ParseHtml(rawHtml);
	sys.puts(sys.inspect(htmlDom, false, null));
	
	Outputs...
		[ { raw: 'Xyz ', data: 'Xyz ', type: 'text' }
		, { raw: 'script language= javascript'
		  , data: 'script language= javascript'
		  , type: 'script'
		  , name: 'script'
		  , attribs: { language: 'javascript' }
		  , children: 
		     [ { raw: 'var foo = \'<bar>\';<'
		       , data: 'var foo = \'<bar>\';<'
		       , type: 'text'
		       }
		     ]
		  }
		, { raw: '<!-- Waah! -- '
		  , data: '<!-- Waah! -- '
		  , type: 'comment'
		  }
		]

About

Forgiving HTML Parser in JS for Node

License:MIT License