Wybrane aspekty integracji informacji z głębokiego Internetu

Flejter, Dominik; Kaczmarek, Tomasz

Artykuł - szczegóły

Czasopismo

Zeszyty Naukowe / Akademia Ekonomiczna w Poznaniu

2007 | nr 96 | 97--110

Tytuł artykułu

Wybrane aspekty integracji informacji z głębokiego Internetu

Autorzy

Dominik Flejter , Tomasz Kaczmarek

Warianty tytułu

Selected Aspects of Information Integration From Deep Web

Języki publikacji

Abstrakty

Głęboki Internet określany bywa również jako ukryty Internet (hidden Web) lub niewidoczny Internet (invisible Web). Źródła te jednak pozostają dostępne dla osób, które zwykle posiadają wiedzę dziedzinową i zdolności kognitywne niezbędne do posługiwania się ich interfejsem użytkownika. W artykule przedstawiono źródła danych w głębokim Internecie oraz omówiono integrację danych pochodzących z głębokiego Internetu. Poruszono również problem nawigacji w źródłach i ekstrakcji danych.

Deep Web, understood as Web interfaces to databases, is a vast and largely unexplored source of information nowadays. Lack of automated methods to access these databases was one of the impediments to wider adoption of Deep Web data for decision support. Information about products, enterprises and organizations is dispersed and their aggregation is labour intensive, requires human expertise and therefore is costly. Hence the need for methods and tools capable of automated retrieval and data integration. In the article we discuss selected aspects of identification of Deep Web data sources, preparation to automated data retrieval and information integration. The concepts and results presented facilitate the use of Deep Web resources, thus giving decision makers better information about enterprises and their environment. (original abstract)

Słowa kluczowe

Internet Bazy danych Źródła informacji

Internet Databases Information source

Czasopismo

Zeszyty Naukowe / Akademia Ekonomiczna w Poznaniu

Rocznik

2007

Numer

nr 96

Strony

97--110

Opis fizyczny

Twórcy

autor

Dominik Flejter

autor

Tomasz Kaczmarek

Bibliografia

Akerlof, G., The Market for Lemons: Quality Uncertainty and the Market Mechanism, Quarterly Journal of Economies, vol. 84, 1970, s. 488-500.
Chang K.C., Не В., Li C., Patel M., Zhang Z. Structured Databases on the Web: Observations and Implications, SIGMOD Rec. 2004, 33(3), s. 61-70.
Flejter D., Hryniewiecki R., Bottom-up Discovery of Clusters of Maximal Ranges in HTML Trees for Search Engines Results Extraction, w: W. Abramowicz (red.) Proceedings of 10th International Conference on Business Information Systems, LNCS 4439, 2007.
Grossman S.J., Stiglitz J.E., On the Impossibility of Informationally Efficient Markets, National Bureau of Economie Research, Inc., 1980.
Не В., Zhang Z., Chang K.C., Knocking the Door to the Deep Web: Integrating Web Query Interfaces, Proceedings of the 2004 ACM SIGMOD. Conference (SIGMOD 2004).
He H., Meng W., Yu C., Wu Z., WISE-Integrator. A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web, Proceedings of 31st International Conference on Very Large Data Bases, 2005.
Kabra G., Li C., Chang K.C., Query Routing: Finding Ways in the Maze of the Deep Web, Proceedings of the ICDE International Workshop on Challenges in Web Information Retrieval and Integration (ICDE-WIRI2005).
Kartchner C., Content Management Systems: Getting from Concept to Reality, The Journal of Electronic Publishing, vol. 3, 1998.
Kowalkiewicz M., Ekstrakcja i agregacja informacji dla potrzeb podmiotów gospodarczych, Akademia Ekonomiczna w Poznaniu, Poznań 2006.
Orlowska M., Kowaïkiewicz M., Kaczmarek T., Abramowicz W., Towards More Personalized Web: Extraction and integration of Dynamic Content from the Web, Springer 2006.
Raghavan S., Garcia-Molina H., Crawling the Hidden Web, Proceedings of 27th International Conference on Very Large Data Bases, 2001.
Sipser M., Introduction to the Theory of Compilation, PWS Publishing Company, 1997.
Specyfikacja języka HTML 4.01, http://www.w3.org/TR/html4/, strona pozyskana w grudniu 2005.

Typ dokumentu

Bibliografia

Identyfikatory

Identyfikator YADDA

bwmeta1.element.ekon-element-000152383899

Komentarze

Musisz być zalogowany aby pisać komentarze.

Zeszyty Naukowe / Akademia Ekonomiczna w Poznaniu

Wybrane aspekty integracji informacji z głębokiego Internetu

Zgłoszenie zostało wysłane

Zgłoszenie zostało wysłane