Monolingual information retrieval in local language: case study in assamese

Barman, Anup Kumar; Sarma, Shikhar Kumar

Artykuł - szczegóły

Czasopismo

Rocznik Naukowy Wydziału Zarządzania w Ciechanowie

2018 | 12 | nr 1-4 | 136--142

Tytuł artykułu

Monolingual information retrieval in local language: case study in assamese

Autorzy

Anup Kumar Barman , Shikhar Kumar Sarma

Warianty tytułu

Jednojęzyczne pozyskiwanie informacji w lokalnym języku: przypadek języka assamskiego

Języki publikacji

Abstrakty

Large amount of information always implies the need of a good retrieval system. The research on Information retrieval (IR) is become very important due to the tremendous growth of digitalized information. Information retrieval system provide the most relevant information from a large collection based on the user query. For the necessity of finding relevant information the research on Information retrieval has been started from 1950. Several IR systems were implemented depending on the nature of information and users. Finding the most relevant information based on the fired query in their own language is the aim of monolingual information Retrieval. In multilingual country like India where 23 official languages exists digitalize local language contents are growing tremendously. To meet the need of each individual's relevant information the monolingual IR in own language is very essential. Here we analyze the basic requirement of developing the monolingual IR. The IR system discussed here is implemented for Assamese Language which is one of the scheduled language of India. The retrieval efficiency of a statistical IR system can be enhanced using linguistic information generated through various Natural Language Processing applications. (original abstract)

Duża ilość informacji zawsze implikuje potrzebę dobrego systemu wyszukiwania. Badania nad pozyskiwaniem informacji (IR) stają się bardzo ważne, ze względu na ogromny wzrost cyfryzacji informacji. System wyszukiwania informacji dostarcza najbardziej istotne informacje z dużej kolekcji, na podstawie zapytania użytkownika. Ze względu na konieczność znalezienia odpowiednich informacji, badania nad pozyskiwaniem informacji rozpoczęto od 1950 r. Wdrożono kilka systemów IR w zależności od charakteru informacji i użytkowników. Znalezienie najistotniejszych informacji w oparciu o zadane zapytanie w lokalnym języku jest celem pobierania informacji jednojęzycznych. W wielojęzycznym kraju, takim jak Indie, gdzie istnieje 23 języków urzędowych, cyfryzacja treści w językach lokalnych ogromnie wzrasta. Aby zaspokoić zapotrzebowanie na istotne informacje każdej osoby, jednojęzyczny wskaźnik IR w lokalnym języku jest bardzo istotny. Poniżej analizujemy podstawowy wymóg opracowania jednojęzycznego IR. Omawiany tutaj system IR jest implementowany dla języka Assamskiego, który jest jednym z zaplanowanych języków Indii. Wydajność pobierania statystycznego systemu IR można zwiększyć dzięki informacjom językowym generowanym w różnych aplikacjach przetwarzania języka naturalnego.(abstrakt oryginalny)

Słowa kluczowe

Natural languages Information Information retrieval Information science

Języki naturalne Informacja Wyszukiwanie informacji Informatyka

Czasopismo

Rocznik Naukowy Wydziału Zarządzania w Ciechanowie

Rocznik

2018

Tom

Numer

nr 1-4

Strony

136--142

Opis fizyczny

Twórcy

autor

Anup Kumar Barman

Central Institute of Technology, Kokrajhar, India

autor

Shikhar Kumar Sarma

Gauhati University, India

Bibliografia

Vannevar Bush. As We May Think. Atlantic Monthly, 176:101-108, July 1945.
Gerard Salton, editor. The SMART Retrieval System-Experiments in Automatic Document Retrieval. Prentice Hall Inc., Englewood Cliffs, NJ, 1971.
Apache Solr(2011)http://lucene.apache.org/solr/
Apache Lucene(2011)http://lucene.apache.org/
Apache Nutch(2005)http://nutch.apache.org/
Apache Heritrix(2012) https://webarchive.jira.com/wiki/display/Heritrix/Heritrix
Barman A.K, Sarmah J, Sarma S.K., Automatic Identification of Assamese and Bodo Multiword Expressions. In Proceedings of Second International Conference on Ad- vances in Computing Communications and Informatics (ICACCI2013), IEEE, Mysore, India.
Barman A.K, Sarmah J, Sarma S.K., POS Tagging of Assamese Language and Performance Analysis of CRF++ & fnTBL Approaches, In Proceedings of the 15th International Conference on Computer Modelling and Simulation, UKSim 2013, IEEE, Cambridge, UK.
Sarma S. K, Medhi R, Gogoi M, Saikia U, Foundation and Structure of Developing an Assamese Wordnet. In Proceedings of GWC 2010.
Sharma P Sarma U, Kalita J., The first Steps towards Assamese Named Entity Recognition, Brisbane Convention Center Brisbane Australia, 2010.
Sarma S. K, Bharali H, Gogoi A, Deka R, Barman A.K., A Structured Approach for building Assamese Corpus: Insights, Application and Challenges, In Proceedings of 10th Workshop on Asian Language Resource.
Ricardo Baeza-Yates and Berthier Ribeiro-Neto. Modern Information Retrieval. ACM Press, 1999.

Typ dokumentu

Bibliografia

Identyfikatory

Identyfikator YADDA

bwmeta1.element.ekon-element-000171547193

Komentarze

Musisz być zalogowany aby pisać komentarze.

Rocznik Naukowy Wydziału Zarządzania w Ciechanowie

Monolingual information retrieval in local language: case study in assamese

Zgłoszenie zostało wysłane

Zgłoszenie zostało wysłane