Taalmaterialen

/ Taalmaterialen /

Via de website taalmaterialen.ivdnt.org stelt het Instituut voor de Nederlandse Taal (INT) bronnen, data en tools beschikbaar voor taalkundig onderzoek en taal- en spraaktechnologie (TST) binnen het hele Nederlandse taalgebied.

Overzicht van alle taalmaterialen

Afrikaans Custom Dictionary for Government Domain
Afrikaans Genre Classification Corpus
AI-Trainingset - Tag de Tekst voor Named Entity Recognition (NER)
Algemeen Nederlands Woordenboek (ANW)
Algemene Nederlandse Spraakkunst (e-ANS)
Annotated Corpora for Term Extraction Research (ACTER)
Annotated Corpora for Term Extraction Research (ACTER)
Attestation Tool
AUTONOMATA-namencorpus
AUTONOMATA-namencorpus Commercieel
AUTONOMATA-POI-corpus
AUTONOMATA-POI-demo
AUTONOMATA-transcriptietoolset
AutoSearch
Autshumato Afrikaans-English Translation Memory
Autshumato English-Afrikaans Parallel Corpora
Autshumato English-Afrikaans Translation Memory
Autshumato English-isiZulu Parallel Corpora
Autshumato English-isiZulu Translation Memory
Autshumato English-Sesotho sa Leboa Parallel Corpora
Autshumato isiZulu-English Translation Memory
Autshumato Sesotho sa Leboa-English Translation Memory
BasiLex-corpus
BasiLex-corpus Commercieel
BasiLex-lexicon
BasiLex-lexicon Commercieel
BasiScript-corpus
BasiScript-corpus Commercieel
BasiScript-lexicon
BasiScript-lexicon Commercieel
Belgian Covid Sign Language Corpus (BeCoS Corpus)
BlackLab
BlackLab Frontend
BLISS Dialogue Summaries
BLISS Spoken Dialogue Dataset
Brieven als Buit (Online)
Brieven als Buit-2 (Online)
Brieven als Buit - Gouden Standaard
Cd-rom Middelnederlands
CELEX-2 Dutch
CGN-annotaties
CGN-annotaties Commercieel
Children's Oral Reading Corpus (CHOREC)
CHN N-grams
CHN N-grams Commercieel
CoBaLT
CombiLex
CombiLex Commercieel
COREA-coreferentiecorpus
COREA-coreferentiecorpus Commercieel
COREA-coreferentieservice
Cornetto-LMF
Corpus Gesproken Nederlands (CGN)
Corpus Gesproken Nederlands (CGN) Commercieel
Corpus Gysseling (Data)
Corpus Gysseling (Online)
Corpus Hedendaags Nederlands - CHN (Online)
Corpus Juridisch Nederlands (Online)
Corpus Middelnederlands (Data)
Corpus Middelnederlands (Online)
Corpus Ondertitelde UVN-Colleges (COUC)
Corpus Oudnederlands (Online)
Corpus Pathologische en Normale Spraak (COPAS)
Corpus Vlaamse Gebarentaal (Corpus VGT)
Couranten Corpus (Online)
D-TUNA-corpus
DAESO-corpus: parallelle Nederlandstalige monolinguale treebank
DAESO-corpus: parallelle Nederlandstalige monolinguale treebank Commercieel
Database van de Zuidelijk-Nederlandse Dialecten (DSDD)
Diachroon seMantisch lexicon van de Nederlandse Taal - DiaMaNT
DuELME
DuELME Commercieel
DuOMAn Subjectivity Lexicon
DuOMAn Subjectivity Lexicon Commercieel
Dupira
Dutch Idiom Database: Native Speakers (DID-NS)
Dutch Idiom Database: Native Speakers (DID-NS) Commercieel
Dutch Parallel Corpus (DPC)
Dutch Parallel Corpus (DPC) Commercieel
e-Lex
e-Lex Commercieel
Eindhoven-corpus
Etymologiebank
Etymologisch Woordenboek van het Nederlands (EWN)
Federated Search Lexica
Frequentielijsten corpora
Frequentielijsten corpora Commercieel
Frog
Gekaapte Brieven (Online)
GiGaNT-Molex
GiGaNT-Molex Commercieel
Gold Standard Parallel Corpus of Sign and spoken Language (GoSt-ParC-Sign)
Greedy Extraction of Trees for Empirical Linguistics - GrETEL
Hoger Onderwijs Terminologie in Nederland en Vlaanderen (HOTNeV)
Hotel Review Corpus in Nederlandse Gebarentaal (NGT_HoReCo)
Hotel Review Corpus in Spanish Sign Language (LSE_HoReCo)
Hotel Review Corpus in Vlaamse Gebarentaal (VGT_HoReCo)
Hulk / Keurmerk Spelling
IFA-corpus
IFA Dialoog-videocorpus
INL Labs

INT Historische Woordenlijst
INT IMPACT NE-lexicon
isiNdebele Custom Dictionary for Government Domain
isiNdebele Genre Classification Corpus
isiXhosa Custom Dictionary for Government Domain
isiXhosa Genre Classification Corpus
isiZulu Custom Dictionary for Government Domain
isiZulu Genre Classification Corpus
JASMIN-spraakcorpus
JASMIN-spraakcorpus Commercieel
Lassy Groot-corpus
Lassy Groot-corpus Commercieel
Lassy Klein-corpus
Lassy Klein-corpus Commercieel
Lwazi Afrikaans ASR Corpus
Lwazi Afrikaans Pronunciation Dictionary
Lwazi English ASR Corpus
Lwazi English Pronunciation Dictionary
Lwazi isiNdebele ASR Corpus
Lwazi isiNdebele Pronunciation Dictionary
Lwazi isiXhosa ASR Corpus
Lwazi isiXhosa Pronunciation Dictionary
Lwazi isiZulu ASR Corpus
Lwazi isiZulu Pronunciation Dictionary
Lwazi Sepedi ASR Corpus
Lwazi Sepedi Pronunciation Dictionary
Lwazi Sesotho ASR Corpus
Lwazi Sesotho Pronunciation Dictionary
Lwazi Setswana ASR Corpus
Lwazi Setswana Pronunciation Dictionary
Lwazi Siswati ASR Corpus
Lwazi Siswati Pronunciation Dictionary
Lwazi Tshivenda ASR corpus
Lwazi Tshivenda Pronunciation Dictionary
Lwazi Xitsonga ASR Corpus
Lwazi Xitsonga Pronunciation Dictionary
Medische Pilot (MedPilot)
Medische Termen Belgisch-Nederlands (MedTermBN)
Medische Termen Belgisch-Nederlands (MedTermBN) Commercieel
Meertalige Ondertiteldata 2BDutch
Memory-Based Morphological Parser (MBMP)
Meta-Covid Ontology 1.0
Middelnederlandsch Woordenboek (MNW)
Moroccorp
MuST-Cinema-PE: post-editing in automatic subtitling
NAMES Corpus
NAMES Corpus Commercieel
Nederlands als Wetenschapstaal: Scheikunde
Nederlands als Wetenschapstaal: Wiskunde
OMBI Arabisch-Nederlands
OMBI Arabisch-Nederlands Commercieel
OMBI Nederlands-Arabisch
OMBI Nederlands-Arabisch Commercieel
OMBI Nederlands-Deens
OMBI Nederlands-Deens Commercieel
OMBI Nederlands-Indonesisch
OMBI Nederlands-Indonesisch Commercieel
Open Dutch Wordnet
OpenSoNaR
Oudnederlands Woordenboek (ONW)
Paco-MT Parallelle Corpora
PAROLE-lexicon
Philosophical Integrator of Computational and Corpus Libraries (PICCL)
Pinkhof Geneeskundig Woordenboek (Online)
Referentiebestand Belgisch-Nederlands (RBBN)
Referentiebestand Belgisch-Nederlands (RBBN) Commercieel
Referentiebestand Nederlands (RBN)
Referentiebestand Nederlands (RBN) Commercieel
RND Woordenlijsten
Sepedi Custom Dictionary for Government Domain
Sesotho Custom Dictionary for Government Domain
Sesotho Genre Classification Corpus
Sesotho sa Leboa Genre Classification Corpus
Setswana Custom Dictionary for Government Domain
Setswana Genre Classification Corpus
Siswati Custom Dictionary for Government Domain
Siswati Genre Classification Corpus
SoNaR-corpus
SoNaR Character n-grams
SoNaR Groot-corpus Commercieel
SoNaR Klein-corpus Commercieel
SoNaR Nieuwe Media Corpus
SumNL-samenvattingencorpus
SumNL-samenvattingencorpus Commercieel
Taalportaal
The LiLaH Emotion Lexicon of Greek, Kurdish, Turkish, Spanish, Farsi and Chinese
Tshivenda Custom Dictionary for Government Domain
Tshivenda Genre Classification Corpus
Vertaalwoordenschat
Vroegmiddelnederlands Woordenboek (VMNW)
VU-DNC-corpus
Wablieft-corpus
WAI-NOT Corpus
WebCelex
Woordcombinaties (Online)
Woordenboek der Friese Taal (WFT)
Woordenboek der Nederlandsche Taal (WNT)
Woordenboek Vlaamse Gebarentaal (Woordenboek VGT)
Xitsonga Custom Dictionary for Government Domain
Xitsonga Genre Classification Corpus

Overzicht van alle taalmaterialen

INT-nieuwsbrief

Terminologienieuwsbrief