Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
-
Updated
Aug 2, 2022 - Python
Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
Public Domain Words and Texts for Conlangs
Scripts de bots, web scrappings e web crawlers para pesquisa.
Emacs Lisp corpus. Code collected from many-many projects for you to query it!
Create a wiki corpus using a wiki dump file for Natural Language Processing
Spam-ham-Classification
Asturian language corpus for FreeLing
A corpus for the Zazaki and Gorani languages
Discursos presidenciales de Latinoamérica en español
🌐 ANT Corpus website repository.
Data for HindiRC
Data pipeline for the coco-explorer app.
For a corpus linguistics project, I created an information retrieval program called "You Are Not Alone". My phrase_finder() function searches for a self-identifying phrase in 4 large classic texts (The Souls of Black Folk, Jane Eyre, The Strange Case of Dr. Jekyll & Mr. Hyde, and Frankenstein). Standpoint: "So Matilda’s strong young mind continu…
Repository of data and results for an undergraduate thesis titled "A Corpus-Based Study to Triangulating Experimental Evidence Regarding Verb-Noun Association for Action Verbs" by I Gede Semara Dharma Putra.
Estonian TIMEX Annotated Corpora \ Eesti keele ajaväljendimärgendustega korpused
Tidy concordances, collocates, and wordlist
Collection of open source javascript projects
Repositório para disponibilização de bases de dados do Wikipedia e Simple Wikipedia pré-processadas, além de scripts de pré-processamento e geração de bases em Python.
Add a description, image, and links to the corpus-data topic page so that developers can more easily learn about it.
To associate your repository with the corpus-data topic, visit your repo's landing page and select "manage topics."