February 24, 2023 We have new paper at IJLC
February 20, 2023 Termout is finally here!
February 16, 2023 Termout is almost there...
10 de febrero, 2023 Irene Renau presenta conferencia en Santiago de Compostela
January 19, 2023 A new version of Termout is coming soon...
16 de enero, 2023 Irene Renau se adjudica un Proyecto Fondecyt
12 de enero, 2023 Cerramos una semana de defensas de tesis
9 de enero, 2023 Nuestros estudiantes defienden sus tesis de Magíster
January 6, 2023 Two of our collaborators are awarded research grants
|
Tools & demosWe have implemented different types of applications and most of them can be tested online. Take a look. + Compare: a simple script to compare two lists of words + Cryptoman: a script to generate cryptograms + Dismark: a multilingual taxonomy of discourse markers + Dsele: a model dictionary for ELE learners + Estilector: computer assisted writing for Spanish + GeNom: a program to detect the gender of proper nouns + HAT: a project for the treatment of polysemy in lexical taxonomies + Jaguar: a tool for statistic corpus analysis + Kind: a lexical taxonomy induction algorithm + Kwico: a concordancer for big corpora + Lealem: a reading pacer for parallel German-Spanish texts + Leafran: a reading pacer for parallel French-Spanish texts + Linguini: a language detector + Neven: a program to detect eventive nouns + POL: named entity recognition and classification + Poppins: a supervised text classifier + Porcus: an interface for various taggers and parsers for Spanish + pullPOS: a project for the detection of plurals in Spanish + Randall: a list randomizer + Readeutsch: a reading pacer for parallel German-English texts + Sapo: a program to detect similarities between documents + Sicam: a program to analyze Spanish poetry + Termout: a terminology extraction system (new version!) + TEXT·A·GRAM: a program to analyze Spanish texts + Verbario: corpus pattern analysis in Spanish |
![]() This is the view from where we are located, in the Sausalito lagoon, a quiet and lovely place in Viña del Mar, Chile. Sunny days. Birds can be seen in the center of the lagoon (click to enlarge). As researchers, we are currently affiliated to:
Av. El Bosque 1290, Viña del Mar, Chile |
Upcoming Events31 de marzo 2023: Estaremos presentando un nuevo proyecto junto con la Revista Perspectiva Educacional, editada por la Escuela de Pedagogía de la Pontificia Universidad Católica de Valparaíso. Se trata de un proyecto muy interesante sobre extracción de terminología e información utilizando Termout.org y otras herramientas desarrolladas específicamente para ese proyecto. Ampliaremos! 30 y 31 de agosto y 1 de setiembre de 2023: Irene Renau y Rogelio Nazar estarán presentando un curso / taller titulado Procesamiento de corpus para lexicografía y terminología, en la Facultad de Filosofía y Letras de la Universidad Nacional de Cuyo (Mendoza, Argentina). Esto será en el contexto de las Jornadas de Estudios Lingüísticos (JELing) 2023. |
Tweets by TeclingGroup | |
Latest ideas & research projects We are developing new projects in computational linguistics and natural language processing:
|
Recent publications+ Robledo, H.; Nazar, R. (2023). A proposal for the inductive categorisation of parenthetical discourse markers in Spanish using parallel corpora. International Journal of Corpus Linguistics. http://doi.org/10.1075/ijcl.20017.rob + Renau, I.; Nazar, R. (2022). Towards a multilingual dictionary of discourse markers: automatic extraction of units from parallel corpus. In: Klosa-Kückelhaus, A.; Engelberg, S.; Möhrs, C.; Storjohann, P. Dictionaries and Society. Proceedings of the XX EURALEX International Congress, Mannheim: IDS-Verlag, pp. 262-272. PDF + Nazar, R; Lindemann, D. (2022). Terminology extraction using co-occurrence patterns as predictors of semantic relevance. Proceedings of the TERM21 Workshop. Language Resources and Evaluation Conference (LREC 2022), Marseille, 20-25 June 2022, pp. 26-29. PDF |
Solutions for text processingIt is critical for organizations to have the ability to process information automatically, and very often that information is contained in documents to be read by humans rather than machines. We have different methods for text processing depending on the goal. We can be helpful teaching people how to automatize their text processing routines. We can batch-process thousands of documents to extract information from them or to derive different types of statistics. We can also change these document, or generate databases or email correspondence based on information extracted from them. Anything that involves intelligent management of information can benefit from different degrees of automatization, and by doing that we can free time, effort and resources. Tell us which are your needs and we will show you what we can do about it. |