TRADISAN ("conTRAstare la DIsinformazione in ambito SANitario tramite fake news detection sui social media"), a dataset developed for the Italian language to assess health-related news reliability. It consists of 32,101 news. We provide each headline with automatic annotations of 31 news reliability features, including stylometric, lexical and sentiment features. Furthermore, each headline has 4 additional annotations, i.e., lemmas, POS, IOB and NER.
TRADISAN
Luca Giordano;Maria Pia di Buono
2023-01-01
Abstract
TRADISAN ("conTRAstare la DIsinformazione in ambito SANitario tramite fake news detection sui social media"), a dataset developed for the Italian language to assess health-related news reliability. It consists of 32,101 news. We provide each headline with automatic annotations of 31 news reliability features, including stylometric, lexical and sentiment features. Furthermore, each headline has 4 additional annotations, i.e., lemmas, POS, IOB and NER.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.