We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Information Retrieval

Title: Analyse et expansion des textes en question-réponse

Authors: Bernard Jacquemin (ISC)
Abstract: This paper presents an original methodology to consider question answering. We noticed that query expansion is often incorrect because of a bad understanding of the question. But the automatic good understanding of an utterance is linked to the context length, and the question are often short. This methodology proposes to analyse the documents and to construct an informative structure from the results of the analysis and from a semantic text expansion. The linguistic analysis identifies words (tokenization and morphological analysis), links between words (syntactic analysis) and word sense (semantic disambiguation). The text expansion adds to each word the synonyms matching its sense and replaces the words in the utterances by derivatives, modifying the syntactic schema if necessary. In this way, whatever enrichment may be, the text keeps the same meaning, but each piece of information matches many realisations. The questioning method consists in constructing a local informative structure without enrichment, and matches it with the documentary structure. If a sentence in the informative structure matches the question structure, this sentence is the answer to the question.
Comments: 11 pp
Subjects: Information Retrieval (cs.IR)
ACM classes: H.3; H.4; H.5
Journal reference: Le poids des mots. Actes des 7es journ\'{e}es internationales d'Analyse statistique des Donn\'{e}es Textuelles (2004) 1219
Cite as: arXiv:cs/0506047 [cs.IR]
  (or arXiv:cs/0506047v1 [cs.IR] for this version)

Submission history

From: Bernard Jacquemin [view email] [via CCSD proxy]
[v1] Sun, 12 Jun 2005 16:39:01 GMT (14kb)

Link back to: arXiv, form interface, contact.