: The coding of medical documents and in particular of rehabilitation notes using the International Classification of Functioning, Disability and Health (ICF) is a difficult task showing low agreement among experts. Such difficulty is mainly caused by the specific terminology that needs to be used for the task. In this paper, we address the task developing a model based on a large language model, BERT. By leveraging continual training of such a model using ICF textual descriptions, we are able to effectively encode rehabilitation notes expressed in Italian, an under-resourced language.
Automated ICF Coding of Rehabilitation Notes for Low-Resource Languages via Continual Training of Language Models
Roitero, KevinPrimo
;Della Mea, Vincenzo
Ultimo
2023-01-01
Abstract
: The coding of medical documents and in particular of rehabilitation notes using the International Classification of Functioning, Disability and Health (ICF) is a difficult task showing low agreement among experts. Such difficulty is mainly caused by the specific terminology that needs to be used for the task. In this paper, we address the task developing a model based on a large language model, BERT. By leveraging continual training of such a model using ICF textual descriptions, we are able to effectively encode rehabilitation notes expressed in Italian, an under-resourced language.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
SHTI-302-SHTI230262.pdf
accesso aperto
Descrizione: paper
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
160.97 kB
Formato
Adobe PDF
|
160.97 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.