DIMMI consists of 600 Italian drug package leaflets. The documents in the DIMMI exhibit a wide range of lengths, with the shortest document containing 363 tokens and the longest extending to 11,730 tokens. DIMMI dataset is derived from the D-LeafIT Corpus, made up of 1819 Italian drug package leaflets. The corpus has been created extracting PILs available on the Italian Agency for Medications (Agenzia Italiana del Farmaco - AIFA), among which 1439 refer to generic drugs and 380 to class A drugs.

DIMMI - Drug InforMation Mining in Italian

Raffaele Manna;Maria Pia di Buono;Luca Giordano
2024-01-01

Abstract

DIMMI consists of 600 Italian drug package leaflets. The documents in the DIMMI exhibit a wide range of lengths, with the shortest document containing 363 tokens and the longest extending to 11,730 tokens. DIMMI dataset is derived from the D-LeafIT Corpus, made up of 1819 Italian drug package leaflets. The corpus has been created extracting PILs available on the Italian Agency for Medications (Agenzia Italiana del Farmaco - AIFA), among which 1439 refer to generic drugs and 380 to class A drugs.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11574/237283
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact