These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
169 related articles for article (PubMed ID: 32671155)
1. Enhancing African low-resource languages: Swahili data for language modelling. Shikali CS; Mokhosi R Data Brief; 2020 Aug; 31():105951. PubMed ID: 32671155 [TBL] [Abstract][Full Text] [Related]
2. Enhancing text pre-processing for Swahili language: Datasets for common Swahili stop-words, slangs and typos with equivalent proper words. Masua B; Masasi N Data Brief; 2020 Dec; 33():106517. PubMed ID: 33294515 [TBL] [Abstract][Full Text] [Related]
3. In the heart of Swahili: An exploration of data collection methods and corpus curation for natural language processing. Masua B; Masasi N Data Brief; 2024 Aug; 55():110751. PubMed ID: 39234059 [TBL] [Abstract][Full Text] [Related]
4. Building lexicon-based sentiment analysis model for low-resource languages. Mohammed I; Prasad R MethodsX; 2023 Dec; 11():102460. PubMed ID: 38023300 [TBL] [Abstract][Full Text] [Related]
5. A Sesotho news headlines dataset for sentiment analysis. Mokhosi R; Shivachi CS; Sethobane M Data Brief; 2024 Jun; 54():110371. PubMed ID: 38590621 [TBL] [Abstract][Full Text] [Related]
6. Swahili speech development: preliminary normative data from typically developing pre-school children in Tanzania. Gangji N; Pascoe M; Smouse M Int J Lang Commun Disord; 2015; 50(2):151-64. PubMed ID: 25134791 [TBL] [Abstract][Full Text] [Related]
7. IndicDialogue: A dataset of subtitles in 10 Indic languages for Indic language modeling. Arnob NMK; Faiyaz A; Fuad MM; Al Masud SMR; Das B; Mridha MF Data Brief; 2024 Aug; 55():110690. PubMed ID: 39109169 [TBL] [Abstract][Full Text] [Related]
8. A comparison of word embeddings for the biomedical natural language processing. Wang Y; Liu S; Afzal N; Rastegar-Mojarad M; Wang L; Shen F; Kingsbury P; Liu H J Biomed Inform; 2018 Nov; 87():12-20. PubMed ID: 30217670 [TBL] [Abstract][Full Text] [Related]
9. Parallel texts dataset for Uzbek-Kazakh machine translation. Allaberdiev B; Matlatipov G; Kuriyozov E; Rakhmonov Z Data Brief; 2024 Apr; 53():110194. PubMed ID: 38425874 [TBL] [Abstract][Full Text] [Related]
10. BengSentiLex and BengSwearLex: creating lexicons for sentiment analysis and profanity detection in low-resource Bengali language. Sazzed S PeerJ Comput Sci; 2021; 7():e681. PubMed ID: 34901419 [TBL] [Abstract][Full Text] [Related]
11. Text data augmentation and pre-trained Language Model for enhancing text classification of low-resource languages. Ziyaden A; Yelenov A; Hajiyev F; Rustamov S; Pak A PeerJ Comput Sci; 2024; 10():e1974. PubMed ID: 38660166 [TBL] [Abstract][Full Text] [Related]
13. Improving Loanword Identification in Low-Resource Language with Data Augmentation and Multiple Feature Fusion. Mi C; Zhu S; Nie R Comput Intell Neurosci; 2021; 2021():9975078. PubMed ID: 33927756 [TBL] [Abstract][Full Text] [Related]
14. Deep learning based sentiment analysis and offensive language identification on multilingual code-mixed data. Shanmugavadivel K; Sathishkumar VE; Raja S; Lingaiah TB; Neelakandan S; Subramanian M Sci Rep; 2022 Dec; 12(1):21557. PubMed ID: 36513786 [TBL] [Abstract][Full Text] [Related]
15. Dataset for Siswati: Parallel textual data for English and Siswati and monolingual textual data for Siswati. Gaustad T; McKellar CA; Puttkammer MJ Data Brief; 2024 Jun; 54():110325. PubMed ID: 38617020 [TBL] [Abstract][Full Text] [Related]
16. SNLI Indo: A recognizing textual entailment dataset in Indonesian derived from the Stanford Natural Language Inference dataset. Putra IMS; Siahaan D; Saikhu A Data Brief; 2024 Feb; 52():109998. PubMed ID: 38235176 [TBL] [Abstract][Full Text] [Related]
17. Sentiment analysis techniques, challenges, and opportunities: Urdu language-based analytical study. Liaqat MI; Awais Hassan M; Shoaib M; Khurshid SK; Shamseldin MA PeerJ Comput Sci; 2022; 8():e1032. PubMed ID: 36091980 [TBL] [Abstract][Full Text] [Related]
18. Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images. Chandio AA; Asikuzzaman M; Pickering M; Leghari M Data Brief; 2020 Aug; 31():105749. PubMed ID: 32490098 [TBL] [Abstract][Full Text] [Related]
19. Recall and response time norms for English-Swahili word pairs and facts about Kenya. Bangert AS; Heydarian NM Behav Res Methods; 2017 Feb; 49(1):124-171. PubMed ID: 26822669 [TBL] [Abstract][Full Text] [Related]
20. BanglaSER: A speech emotion recognition dataset for the Bangla language. Das RK; Islam N; Ahmed MR; Islam S; Shatabda S; Islam AKMM Data Brief; 2022 Jun; 42():108091. PubMed ID: 35392615 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]