Publikacje

Streszczenie:

W tej pracy przedstawiono próbę stworzenia aplikacji umożliwiającej swobodniejszą nawigację użytkownika wśród zasobów Internetu za pomocą poleceń mowy, klasyfikację oraz uporządkowanie przeglądanej informacji. Aplikacja posiada dwa zasadnicze moduły, przy pomocy których możliwe jest przeglądanie informacji w Internecie. Pierwszy moduł nawigacji, przetwarza strony internetowe, wyodrębnia z nich elementy nawigacyjne takie jak odnośniki do innych stron, oraz nadaje elementom identyfikacyjną nazwę, dzięki której użytkownik może wydawać słowne polecenia. Strona internetowa wyświetlona zostaje użytkownikowi w niemalże oryginalnej postaci. Drugi moduł również przetwarza strony internetowe, wyodrębniając z nich elementy nawigacyjne. Jedyną różnicą w działaniu obu modułów jest sposób przetwarzania strony i ostatecznej jej reprezentacji. Drugi moduł wyodrębnia z elementów słownictwo, dzięki któremu możemy sklasyfikować informację znajdującą się na stronie, uzyskując i wyświetlając w ten sposób uporządkowany zbiór elementów nawigacyjnych. Aplikacja zaimplementowana została w języku Java z wykorzystaniem oprogramowania Oracle. W przypadku systemu rozpoznawania mowy zastosowano narzędzie Sphinx-4.

Abstract:

The paper presents an attempt to create an application enabling the user to surf much easier the resources of the Internet with the help of voice commands, as well as to classify and arrange the browsed information. The application has two basic modules which enable browsing the information on the Internet. The first navigation module processes websites, isolates navigation elements , such as links to other websites, from them and gives an identification name to the elements, which enables the user to pronounce voice commands. The website is presented to the user in a practically original form. The second module also processes websites, isolating navigation elements from them. The only difference in operation of the both modules is the mode of processing the website and its final presentation. The second module isolates from the elements vocabulary, which makes it possible to classify the information included in the website, this way acquiring and displaying, an ordered set of navigation elements. The application was implemented in Java language with the use of Oracle software. For the system of recognition and understanding of speech the Sphinx 4 tool was used.

B I B L I O G R A F I A[1] Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, Peter Wolf, Joe Woelfel: Sphinx-4: A Flexible Open Source Framework for Speech Recognition, SMLI TR-2004-139 Sun Microsystems Inc., November 2004.
[2] Skubis T., Dulas J.: Parametryzacja sygnału stochastycznego za pomocą siatek dwuwymiarowych, Pomiary Automatyka Kontrola nr 7/8, 2002.
[3] A. Abdollahzadeh Barfourosh, H. R. Motahary Nezhad, M. L. Anderson, D. Perlis: Information Retrieval on the WWW and Active Logic: A Survey and Problem Definition, 2002.
[4] Michel Genereux, Alexandra Klein and Harald Trost: A Multimodal Speech Interface for Accessing Web Pages, Conference TALN 2000, Lausanne, 16-18 October 2000.
[5] Shairaj Shaik, Raymond Corvin, Rajesh Sudarsan, Faizan Javed, Qasim Ijaz, Suman Roychoudhury, Jeff Gray, Barrett Bryant: Speech-Clipse - An Eclipse Speech Plug-in, Eclipse Technology eXchange Workshop (OOPSLA), Anaheim, CA, October 2003.
[6] Basztura Czesław: Komputerowe systemy diagnostyki akustycznej, Wydawnictwo Naukowe PWN, Warszawa 1996.
[7] The Source for Java Technology Collaboration, http://java.net/
[8] Pascale Fung, Cheung Chi Shun, Lam Kwok Leung, Liu Wai Kat and Lo Yuen Yee: Salsa Version 1.0: A Speech-based Web Browser for Hong Kong English, 5th International Conference on Spoken Language Processing (ICSLP 98), Sydney, Australia, 1998.
[9] K. Huang and J. Picone: Internet-Accessible Speech Recognition Technology, presented at the IEEE Midwest Symposium on Circuits and Systems, Tulsa, Oklahoma, USA, August 2002.
[10] Dulas J.: Zastosowanie metody siatek o zmiennych parametrach do identyfikacji fonemów mowy polskiej, XL VIII Otwarte Seminarium z Akustyki, 2001.

POWRÓT

Strona główna > Publikacje

Speech command based application enabling Internet navigation