Tecling logo » The universe is not perfect, but it's working on it.      ABOUT RESEARCH SOLUTIONS SOFTWARE CONTACT
Technologies for Linguistic Analysis

27 de mayo, 2023
Benjamín López-Hidalgo presenta trabajo sobre neología semántica


Ayer presentó Benjamín López-Hidalgo su trabajo titulado 'Neologismos semánticos nominales de la pandemia por covid-19: análisis contrastivo de patrones de uso en un corpus diacrónico de prensa chilena' en el Congreso SOCHIL 2023 (Sociedad Chilena de Lingüística), en la Universidad Católica Silva Henríquez. Esta es una investigación que deriva de su tesis de magíster en la Pontificia Universidad Católica de Valparaíso


26 May, 2023
Hernán Robledo presented his work in Paris


Our colleague Hernán Robledo presented his work today at the Discourse Markers-Theories and Methods Conference, which is now taking place at the Université Paris Cité (Paris, France). The title of the talk is 'Discourse markers variants: a corpus study of Linguistics research article abstracts in Spanish', which is part of his post-doctoral research (Fondecyt Postdoctorado 3230617).


24 de mayo, 2023
Ayer presentamos nuestro trabajo en SOCHIL 2023


Estuvimos ayer en la Ciudad de Santiago, Chile, para presentar nuestro trabajo en el Congreso SOCHIL 2023 (Sociedad Chilena de Lingüística), celebrado esta vez en la Universidad Católica Silva Henríquez. Se trata de una investigación en colaboración de Rogelio Nazar, Irene Renau, Nicolás Acosta y Hernán Robledo, orientada a introducir mejoras en estilector.com. Concretamente, estamos desarrollando herramientas para mejorar el uso de marcadores discursivos en la escritura académica.


9 de mayo [Actualizado: 17 May 2023]
Presentamos un estudio sobre puntuación de marcadores discursivos


Viajamos a General Roca, Argentina, para participar en el Congreso de la Sociedad Argentina de Estudios Lingüísticos (SAEL 2023), celebrado entre los días 10 y 13 de mayo, y organizado por la Facultad de Lenguas de la Universidad Nacional del Comahue. Presentamos un nuevo módulo que agregaremos a Estilector.com, consistente en un estudio sobre patrones de puntuación de marcadores discursivos en castellano.
En este contexto hemos desarrollado un nuevo prototipo, Punkt
( http://www.tecling.com/punkt )
, que ofrece las formas más comunes de puntuación según un estudio estadístico de corpus. El sistema puede determinar además si un marcador discursivo en un texto determinado sigue o no un patrón convencional. Seguiremos ampliando sobre este tema en los próximos días.


28 April 2023 [UPDATED: 17 Mayo 2023]
We are in a few conferences this year


Somehow we managed to get ourselves in a lot of conferences this year. Maybe a little too much? Well, the list might just get even longer.

  1. 10-13 May 2023: Rogelio Nazar, Irene Renau and Nicolás Acosta will be presenting a paper at SAEL 2023 (Sociedad Argentina de Estudios Lingüísticos), at Facultad de Lenguas, Universidad Nacional del Comahue (General Roca, Argentina).
  2. 23-25 May 2023: Rogelio Nazar, Irene Renau, Nicolás Acosta and Hernán Robledo will be presenting a paper at SOCHIL 2023 (Sociedad Chilena de Lingüística), at Universidad Católica Silva Henríquez (Santiago, Chile).
  3. 1-2 June 2023: Rogelio Nazar will be presenting a paper at TOTH 2023, to be held at the University Savoie Mont-Blanc (France).
  4. 1° de junio 2023, a las 16h argentina (15h chilena), online: Rogelio Nazar, Irene Renau y Nicolás Acosta harán una presentación en las III Jornadas de Corrección de Textos en Español, organizadas por la Universidad del Salvador (Buenos Aires, Argentina).
  5. 17-21 July 2023: Irene Renau and Rogelio Nazar will be presenting two papers at MKR 2023 (the 9th International Conference on Meaning and Knowledge Representation), at Pontificia Universidad Católica de Chile (Santiago, Chile).
  6. 30-31 August - 1 September 2023: Irene Renau and Rogelio Nazar will be offering a workshop entitled Corpus Processing for lexicography and terminology, collocated at JELing 2023 (Jornadas de Estudios Lingüísticos), at Facultad de Filosofía y Letras de la Universidad Nacional de Cuyo (Mendoza, Argentina).
  7. 27 September - 1 October 2023: Irene Renau and Rogelio will be presenting two papers at CILH 2023 (Congreso Internacional de Lingüística Hispánica), at Universität Leipzig (Germany).


6 de abril 2023
Nicolás Acosta en el número 2 del ranking de la UNCuyo


Nicolás Acosta, miembro de Tecling y actualmente estudiante de Filosofía y Letras en la Universidad Nacional de Cuyo (Argentina), acaba de obtener el segundo puesto a nivel de toda la Universidad por el puntaje del Programa de Becas EVC-CIN de la Secretaría de Investigación, Internacionales y Posgrado convocatoria 2022.
No está mal!




February 24, 2023
We have new paper at IJLC


We have a new paper published online in the International Journal of Corpus Linguistics with the title “A proposal for the inductive categorisation of parenthetical discourse markers in Spanish using parallel corpora”, by Hernán Robledo and Rogelio Nazar. It is now available (behind a paywall) at http://doi.org/10.1075/ijcl.20017.rob



February 20, 2023
Termout is finally here!


We are very happy to announce the official opening of the new version of http://www.Termout.org, our term extraction system. With this program, users will be able to process specialized corpora, extract terms, semantic categories, definitions, equivalences, synonyms and more.
Enjoy with moderation!



Tools & demos

We have implemented different types of applications and most of them can be tested online. Take a look.

+ Compare: a simple script to compare two lists of words

+ Cryptoman: a script to generate cryptograms

+ Dismark: a multilingual taxonomy of discourse markers

+ Dsele: a model dictionary for ELE learners

+ Estilector: computer assisted writing for Spanish

+ GeNom: a program to detect the gender of proper nouns

+ HAT: a project for the treatment of polysemy in lexical taxonomies

+ Jaguar: a tool for statistic corpus analysis

+ Kind: a lexical taxonomy induction algorithm

+ Kwico: a concordancer for big corpora

+ Lealem: a reading pacer for parallel German-Spanish texts

+ Leafran: a reading pacer for parallel French-Spanish texts

+ Linguini: a language detector

+ Neven: a program to detect eventive nouns

+ POL: named entity recognition and classification

+ Poppins: a supervised text classifier

+ Porcus: an interface for various taggers and parsers for Spanish

+ pullPOS: a project for the detection of plurals in Spanish

+ Punkt: punktuation of discourse markers in Spanish (new!)

+ Randall: a list randomizer

+ Readeutsch: a reading pacer for parallel German-English texts

+ Sapo: a program to detect similarities between documents

+ Sicam: a program to analyze Spanish poetry

+ Termout: a terminology extraction system (new version!)

+ TEXT·A·GRAM: a program to analyze Spanish texts

+ Verbario: corpus pattern analysis in Spanish

Sausalito

This is the view from where we are located, in the Sausalito lagoon, a quiet and lovely place in Viña del Mar, Chile. Sunny days. Birds can be seen in the center of the lagoon (click to enlarge).

As researchers, we are currently affiliated to:
Pontificia Universidad Católica de Valparaíso
Instituto de Literatura y Ciencias del Lenguaje

Av. El Bosque 1290, Viña del Mar, Chile

Upcoming Events
[UPDATED: 17 May 2023]

End of May 2023: We will be presenting the results of a new project with Revista Perspectiva Educacional, an Education journal published by the School of Pedagogy of the Ponthifical Catholic University of Valparaíso. It is a project about terminology extraction and term database generation with Termout.org as well as other tools specifically developed for this project.

1 June 2023, 15:45h (France time: GMT+2), online: Rogelio Nazar will be presenting a paper at the TOTH 2023 Conference, to be held at the University Savoie Mont-Blanc (France).

1° de junio 2023, a las 16 hora argentina (15 h chilenas), online: Rogelio Nazar, Irene Renau y Nicolás Acosta harán una presentación en las III Jornadas de Corrección de Textos en Español, organizadas por la Universidad del Salvador (Buenos Aires, Argentina).

17-21 July 2023: Irene Renau and Rogelio Nazar will be presenting two papers at MKR 2023 (the 9th International Conference on Meaning and Knowledge Representation), at Pontificia Universidad Católica de Chile (Santiago, Chile)

30-31 August - 1 September 2023: Irene Renau and Rogelio Nazar will be offering a workshop entitled Corpus Processing for lexicography and terminology, collocated at JELing 2023 (Jornadas de Estudios Lingüísticos), at Facultad de Filosofía y Letras de la Universidad Nacional de Cuyo (Mendoza, Argentina).

27 September - 1 October 2023: Irene Renau and Rogelio will be presenting two papers at CILH 2023 (Congreso Internacional de Lingüística Hispánica), at Universität Leipzig (Germany).

Latest ideas & research projects

We are developing new projects in computational linguistics and natural language processing:

+ Fondecyt Regular (2023-2027): "Mapa de las metáforas conceptuales en sustantivos y verbos del español: un estudio de los patrones metafóricos basado en corpus". Lead researcher: Irene Renau. Co-researcher: Rogelio Nazar.

+ Fondecyt Regular (2019-2021): "Polisemia regular de los sustantivos del español: análisis semiautomático de corpus, caracterización y tipología" (Regular polysemy of nouns in Spanish: semiautomatic analysis of corpus, characterization and tipology). Lead researcher: Irene Renau. Ref.: 1191204.

+ Fondecyt Regular (2019-2021): "Inducción automática de taxonomías de marcadores discursivos a partir de corpus multilingües" (Automatic induction of taxonomies of discourse markers from multilingual corpora). Lead researcher: Rogelio Nazar. Ref.: 1191481.

+ Ecos-Sud (International Project between Chile and France): "Inducción automática de taxonomías del español y el francés mediante técnicas cuantitativas y estadística de corpus". Lead researcher: Irene Renau. Ref.: C16H02.

+ Fondecyt Regular: "Desarrollo de la competencia terminológica a lo largo de la inserción disciplinar". Lead Researcher: Sabela Fernández. Co-researcher: Rogelio Nazar. Ref.: 11121597.

+ See more.

Recent publications

+ Robledo, H.; Nazar, R. (2023). A proposal for the inductive categorisation of parenthetical discourse markers in Spanish using parallel corpora. International Journal of Corpus Linguistics. http://doi.org/10.1075/ijcl.20017.rob

+ Renau, I.; Nazar, R. (2022). Towards a multilingual dictionary of discourse markers: automatic extraction of units from parallel corpus. In: Klosa-Kückelhaus, A.; Engelberg, S.; Möhrs, C.; Storjohann, P. Dictionaries and Society. Proceedings of the XX EURALEX International Congress, Mannheim: IDS-Verlag, pp. 262-272. PDF

+ Nazar, R; Lindemann, D. (2022). Terminology extraction using co-occurrence patterns as predictors of semantic relevance. Proceedings of the TERM21 Workshop. Language Resources and Evaluation Conference (LREC 2022), Marseille, 20-25 June 2022, pp. 26-29. PDF

Solutions for text processing

It is critical for organizations to have the ability to process information automatically, and very often that information is contained in documents to be read by humans rather than machines. We have different methods for text processing depending on the goal.

We can be helpful teaching people how to automatize their text processing routines. We can batch-process thousands of documents to extract information from them or to derive different types of statistics. We can also change these document, or generate databases or email correspondence based on information extracted from them. Anything that involves intelligent management of information can benefit from different degrees of automatization, and by doing that we can free time, effort and resources.

Tell us which are your needs and we will show you what we can do about it.