eConn Html Parser is a simple Html scanner and tag balancer that enables application programmers to parse Html documents and access the information using standard XML interfaces. The parser can scan Html files and "fix up" many common mistakes that human and computer authors make in writing Html documents. eConn Html parser adds missing parent elements; automatically closes elements with optional end tags; and can handle mismatched inline element tags.
Most data on the Web is stored in the Hypertext Markup Language (HTML) format. There are many times that you might want to parse HTML in your C# application. However, the .NET framework does not provide an easy way to parse HTML. Evidence of this is the numerous questions posted by C# programmers looking for an easy way to parse HTML..
The Microsoft .NET framework includes extensive support for Extensible Markup Language (XML). However, although XML and HTML look very similar, they are not very compatible. Consider the following major differences between XML and HTML:
XML requires end tags.
All XML attribute values must be fully quoted with either single or double quotes.
XML tags must be properly nested.
XML tag names are case sensitive.
XML does not allow duplicate attributes.
Empty attributes are not allowed in XML.
Features
Browser Password Protected site:
Our technology can browse any password protected website with given login and password.
Browser all internal pages:
System is trained enough to browse all internal pages and extract the data.
Extract data on regular expression: .
System can extract complete data based on regular expressions. With our Resume Parser, we can extract Name, City Zip etc. from generic resume given.
Export data into various formats.
Export data into various formats, CSV, XLS, XML, SQL Database, MS Access or ODBC.
Update / Insert / Delete local data .
Can update / Insert / Delete local database based on changes in target website.
Trusted Technology:
Written in Microsoft C#.net 2.0 which is very well trusted and robust technology.