GEOLSemantics develops its multilingual semantic extraction technology by participating to research and development projects in partnership at the national and European level.
SAIMSI project :
The SAIMSI project aims at realizing a system prototype which would accumulate the structured information about the actions of people suspected of illicit activities.
To know more..
The information extracted from the various languages is represented using the standards of the semantic Web (RDF) independently from the languages and compliant with an ontology of the safety elaborated within the framework of the project. English was chosen to represent concepts and relations.
The collected information is managed within two databases: one knowledge base containing the structured information from the different documents and a textual database searchable in interlingua and containing the source documents. During the display of text in the textual DB you may ask for structured information from the knowledge base on a quoted entity (person, location, company…). Conversely, for every information of the knowledge base you may recover all original documents in the textual DB.
- Extraction system for Personal Attributes Extraction of CLP2014, The Third CIPS-SIGHAN Joint Conference on Chinese Language Processing, Wuhan, China, October 2014
- Une approche linguistique pour l’extraction des connaissances dans un texte arabe, colloque TALN-RÉCITAL, Les Sables d’Olonne, juin 2013
- Une approche mixte morpho-syntaxique et statistique pour la reconnaissance d’entités nommées en langue chinoise, colloque TALN-RÉCITAL, Les Sables d’Olonne, juin 2013
- SAIMSI, Suivi Adaptatif Interlingue et MultiSources des Informations, colloque WISG2013, Troyes, janvier 2013
- Using Arabic Transliteration to Improve Word Alignment from French – Arabic Parallel Corpora, The Fourth Workshop on Computational Approaches to Arabic Script-based Languages, San Diego, California, November 2012
- Extraction of information on activities of persons suspected of illegal activities from web open sources, colloque Language Resources for Public Security Applications, Istanbul, Turquie, mai 2012
- Transcription of Arabic Names into Latin, colloque Sciences of Electronics, Technologies of Information and Telecommunications, Sousse, Tunisia, March 2012
ORELO project :
ORELO aims at working out identification techniques of Arabic dialectal origin of a text written in Arabic or Latin characters or a quote. The dialects considered by the project are the main dialects of the Maghreb (Moroccan, Algerian, Tunisian) and the Egyptian. To know more..
- CODA : A conventional orthography for Algerian Arabic, Arabic Natural Language Processing Workshop, Beijing, July 2015
- La reconnaissance automatique des dialectes arabes à l’écrit, Colloque international traduction et champs connexes, quelle place pour la langue arabe aujourd’hui?, Alger, 18-20 décembre 2013
- Une approche linguistique pour la détection des dialectes arabes, 24e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) p. 242, Orléans, Juin 2017
DRIRS project :
The DRIRS project aims to identify activities about promoting radical ideas on social networks, spot influences and establish circles of probable recruits. This is the upstream activity of radicalization that uses unencrypted networks to reach the maximum audience.
- Automatic Identification of Maghreb Dialects Using a Dictionary-Based Approach, In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, may 2018
- Approche Hybride pour la translitération de l’Arabizi Algérien: Une enquête préliminaire, In 25e Conférence sur le Traitement Automatique des Langues Naturelles (TALN) p. 509-517, Rennes, mai 2018
- Arabic Natural Language Processing: an overview. In Journal of King Saud University – Computer and Information Sciences. King Saud University (2019).
Other published papers :
- Uncertainty Evaluation in Textual Document, 11th International Workshop on Uncertainty Reasoning for the Semantic Web (URSW), ISWC 2015 workshop, Bethlehem, Pennsylvania , October 2015
- RDF Knowledge Graph Visualization From a Knowledge Extraction System, Summarizing and Presenting Entities and Ontologies (SumPre), ESWC 2015 workshop, Portoroz, Slovénia , May 2015
- Gestion de l’incertitude dans le cadre d’une extraction des connaissances à partir de texte, 12ème atelier sur la Fouille de Données Complexes (FDC) Extraction et Gestion des Connaissances (EGC 2015), Luxembourg, janvier 2015
- Synthèse de concepts formels par réécriture à partir d’une ontologie client, 13ème Conférence Francophone sur l’Extraction et la Gestion des Connaissances (EGC 2013), Toulouse, janvier 2013