University of Warsaw » Faculty of Modern Languages » Institute for French Studies

Courses Descriptions

INTRODUCTION TO COMPUTATIONAL LINGUISTICS

Places available: 12

Course delivered in: French

The course is focused on NLP(Natural Language Processing) (TAL, in French)  .

It begins with an introduction to the Linux system; special attention is given to its architecture and to the commands:  ls, more, less, cat, cp, mv, rm, find, locate, cd, mkdir, rmdir, pwd, clear, df, gzip, kill, diff, ln, man, mount, ps, tar, umount, wc.

 Follows a discussion of selected system utilities and freeware tools useful in language processing, especially in lexicography:  sed, awk, vi, grep, tail, head, sort, uniq.

After the course on Linux, a test will take place in order to check students' ability.

Credit: for the Ist term - a positive evaluation of the test.

During the second term, the Unix / NooJ are presented, which are used in treatment of large text corpuses. The students shall learn their architecture (especially lexical modules and local grammars) and the kinds of problems the two systems can solve

Credit: for the IInd term - preparation of one local grammar (in the morphological or syntactic module) and a fragmetary lexicon on the basis of a short demonstration text.  

 In parallel students are informed of the developments in NLP / TAL.

 

Bibliography:

Derwojedowa, M., Rudolf, M., Świdziński, M.: "Dehomonimizacja i desynkretyzacja w procesie automatycznego przetwarzania wielkich korpusów tekstów polskich". [W:] Biuletyn Polskiego Towarzystwa Językoznawczego LVIII, Warszawa 2002. 187-199.

Jurafsky D., Martin J. (2000). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Upper Saddle River: Prentice Hall.

Kennedy, G. (1998). An Introduction to Corpus Linguistics. London: Addison Wesley Longman.

Laver, J. and Dry, H. (1998). Using Computers in Linguistics. London: Routlege.

Frédéric Pascal - Petit Manuel d'utilisation de LINUX, Université Paris-Sud

version 04.2 : septembre 2004

Mitkov, R. (2003). The Oxford Handbook of Computational Linguistics. Oxford: OUP.

Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford: OUP.

http://www.math.u-psud.fr/~pascal/linux4-1/linux4-1.pdf

http://www.nothing2hide.net/fr/unixlinux/ubuntu-pocket-guide-manuale-di-linux-ubuntu-gratis/



Courses details:
The programme, including the description of particular courses, is the same both for full-time studies and for part-time studies.
Show all
Show only this unit: Department for Methodology of French Teaching
Department for Romance Linguistics
Department for French Literature
Show year: I, II, III, IV, V
Courses: EKO, FIL, GK, GO, GO-FON, HF, HLF, JH, LEKT-DJ, LIC, ŁAC, MAG, MBJ, MBL, MET, MONO, NPJ, NPL, PED, PNJF, PNJF-ATL, PNJF-LABO, PP, PROS, PSY, PU, SOC, TEM, TLD, TnF, TnP, TP, UF, WF, WJ, WL, WoKF, ZHJ