Resource
hdl:10032/1b775bfc356df38fba40de0e23e198f7
VU-DNC
TECHNISCH OVERWICHT . Atoombommen niƩt langer schaars
WR-P-P-G_newspapers
174
word tokens
Dutch
nld
Tokenization performed by Kirsten Vis with tool Tadpole
Lemmatization performed by Kirsten Vis with tool Tadpole
POS-tagging performed by Kirsten Vis with tool Tadpole
Annotations for subjectivity added by Kirsten Vis
Quotations annotated by Kirsten Vis
Conversion from original XML to FoLiA XML by Maarten van Gompel with tool vudnc2folia.xslt
OCR output produced by Abbyy Finereader Professional version 8
Automatic Gold standard and OCR alignment by Martin Reynaert with tool Goldie-Oldie
Manual Gold standard and OCR alignment checking and correction by student-assistants
Automatic OCR version inclusion in FoLiA by Martin Reynaert with tool AligntoFoLiA.64.pl
Automatic OCR version inclusion in FoLiA verified and approved by Martin Reynaert with tool CheckAligntoFoLiA.17.pl
Up to three best-ranked spelling correction suggestions automatically added by Martin Reynaert with tool TICCL
Metadata extracted from and updated in FoLiA and CMDI metadata file with normalized metadata created by Martin Reynaert with tool FoLiAtoCMDI.58.pl on the basis of STEVIN SoNaR CMDI template
Resource PIDs provided by CLARIN Centre INL added to CMDI metadata file by Martin Reynaert with tool FoLiAtoCMDI.58.pl
FoLiA XML validation by Martin Reynaert with tool foliavalidator.py