DIMMI consists of 600 Italian drug package leaflets. The documents in the DIMMI exhibit a wide range of lengths, with the shortest document containing 363 tokens and the longest extending to 11,730 tokens. DIMMI dataset is derived from the D-LeafIT Corpus, made up of 1819 Italian drug package leaflets. The corpus has been created extracting PILs available on the Italian Agency for Medications (Agenzia Italiana del Farmaco - AIFA), among which 1439 refer to generic drugs and 380 to class A drugs.
DIMMI - Drug InforMation Mining in Italian
Raffaele Manna;Maria Pia di Buono;Luca Giordano
2024-01-01
Abstract
DIMMI consists of 600 Italian drug package leaflets. The documents in the DIMMI exhibit a wide range of lengths, with the shortest document containing 363 tokens and the longest extending to 11,730 tokens. DIMMI dataset is derived from the D-LeafIT Corpus, made up of 1819 Italian drug package leaflets. The corpus has been created extracting PILs available on the Italian Agency for Medications (Agenzia Italiana del Farmaco - AIFA), among which 1439 refer to generic drugs and 380 to class A drugs.File in questo prodotto:
Non ci sono file associati a questo prodotto.
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.