We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection

Abstract: The article describes a fast solution to propaganda detection at SemEval-2020 Task 11, based onfeature adjustment. We use per-token vectorization of features and a simple Logistic Regressionclassifier to quickly test different hypotheses about our data. We come up with what seems to usthe best solution, however, we are unable to align it with the result of the metric suggested by theorganizers of the task. We test how our system handles class and feature imbalance by varying thenumber of samples of two classes (Propaganda and None) in the training set, the size of a contextwindow in which a token is vectorized and combination of vectorization means. The result of oursystem at SemEval2020 Task 11 is F-score=0.37.
Comments: 5 pages -- the article proper; 2 pages -- references; 3 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2008.09869 [cs.CL]
  (or arXiv:2008.09869v1 [cs.CL] for this version)

Submission history

From: Nadezhda Ganzherli [view email]
[v1] Sat, 22 Aug 2020 16:31:01 GMT (304kb)

Link back to: arXiv, form interface, contact.