
PLOS Digital Health, Journal Year: 2025, Volume and Issue: 4(3), P. e0000765 - e0000765
Published: March 19, 2025
Background The use of social media platforms in health research is increasing, yet their application studying rare diseases limited. Hodgkin’s lymphoma (HL) a malignancy with high incidence young adults. This study evaluates the feasibility using data to disease and treatment characteristics HL. Methods We utilized X (formerly Twitter) API v2 developer portal download posts tweets) from January 2010 October 2022. Annotation guidelines were developed literature manual review limited was performed identify class attributes (characteristics) HL discussed on X, create gold standard dataset. dataset subsequently employed train, test, validate Named Entity Recognition (NER) Natural Language Processing (NLP) application. Results After preparation, 80,811 collected: 500 for annotation guideline development, 2,000 NLP remaining 78,311 deploying identified nine classes related HL, such as classification, etiopathology, stages progression, treatment. progression most frequently discussed, 20,013 (25.56%) mentioning HL’s treatments 17,177 (21.93%) progression. model exhibited robust performance, achieving 86% accuracy an 87% F1 score. etiopathology demonstrated excellent 93% 95% Discussion displayed efficacy extracting characterizing HL-related information posts, evidenced by Nonetheless, presented limitations distinguishing between patients, providers, caregivers establishing temporal relationships attributes. Further necessary bridge these gaps. Conclusion Our potential valuable preliminary source understanding Lymphoma.
Language: Английский