Why solution is not feasible in HTML ?
HTML tag set is too limited
- to represent or differentiate between the multitude of database fields
- hard to automate the process
HTML is incapable of representing the variety of structures in those documents.
HTML lacks mechanism in checking the data for structural validity