Enhance the data alignment results using multiple annotators

B Prasad

Abstract


The Internet presents a huge amount of useful information which is usually formatted for its users.The information which is extracted from different databases it is also called as Web database.In online shopping our required information is not present in a single position.we need deep web data collection.In this project, we present an automatic annotation approach that all records  align into a table.And similar concepts of information form as a group it is called as aggregation.For each  and every group of similar contents we assign the labels from multiple number of structures then we create final annotation.After extract the records directly we align into a annotation.No need to perform first two steps.All the user can easily understand the meaningful results.


Keywords


Data alignment, data annotation, web database, wrapper generation.

References


A. Arasu and H. Garcia-Molina, “Extracting Structured Data from Web Pages,”Proc. SIGMOD Int’l Conf. Management of Data,2003.

L. Arlotta, V. Crescenzi, G. Mecca, and P. Merialdo, “Automatic Annotation of Data Extracted from Large Web Sites,”Proc. Sixth Int’l Workshop the Web and Databases (WebDB),2003.

P. Chan and S. Stolfo, “Experiments on Multistrategy Learning by Meta-Learning,”Proc. Second Int’l Conf. Information and Knowledge Management (CIKM),1993.

W. Bruce Croft, “Combining Approaches for Information Retrieval,”Advances in Information Retrieval: Recent Research from the Center for Intelligent Information Retrieval,Kluwer Academic, 2000.

V. Crescenzi, G. Mecca, and P. Merialdo, “RoadRUNNER: Towards Automatic Data Extraction from Large Web Sites,”Proc. Very Large Data Bases (VLDB) Conf.,2001.

S. Dill et al., “SemTag and Seeker: Bootstrapping the Semantic Web via Automated Semantic Annotation,”Proc. 12th Int’l Conf. World Wide Web (WWW) Conf.,2003.

H. Elmeleegy, J. Madhavan, and A. Halevy, “Harvesting Relational Tables from Lists on the Web,” Proc. Very Large Databases (VLDB) Conf.,2009.

D. Embley, D. Campbell, Y. Jiang, S. Liddle, D. Lonsdale, Y. Ng,

and R. Smith, “Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages,”Data and Knowledge Eng.,vol. 31,no. 3, pp. 227-251, 1999.

D. Freitag, “Multistrategy Learning for Information Extraction,”Proc. 15th Int’l Conf. Machine Learning (ICML),1998.

D. Goldberg,Genetic Algorithms in Search, Optimization and Machine

Learning.Addison Wesley, 198.

S. Handschuh, S. Staab, and R. Volz, “On Deep Annotation,”Proc.12th Int’l Conf. World Wide Web (WWW),2003.

S. Handschuh and S. Staab, “Authoring and Annotation of Web Pages in CREAM,”Proc. 11th Int’l Conf. World Wide Web (WWW),2003.

B. He and K. Chang, “Statistical Schema Matching Across Web Query Interfaces,”Proc. SIGMOD Int’l Conf. Management of Data,2003.

H. He, W. Meng, C. Yu, and Z. Wu, “Automatic Integration of Web Search Interfaces with

no. 3, pp. 256-273, Sept. 2004.

H. He, W. Meng, C. Yu, and Z. Wu, “Constructing Interface Schemas for Search Interfaces of Web Databases,” Proc. Web Information Systems Eng. (WISE) Conf.,2005.

J. Heflin and J. Hendler, “Searching the Web with SHOE,” Proc. AAAI Workshop,2000.


Full Text: PDF[FULL TEXT]

Refbacks

  • There are currently no refbacks.


Copyright © 2013, All rights reserved.| ijseat.com

Creative Commons License
International Journal of Science Engineering and Advance Technology is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJSEat , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.