Resource
hdl:10032/092a8ae1decb5d255d6c8b4d1f070005
VU-DNC
Strijd op de Champs Elysées over Skorzeny in de Figaro
WR-P-P-G_newspapers
223
word tokens
Dutch
nld
Tokenization performed by Kirsten Vis with tool Tadpole
Lemmatization performed by Kirsten Vis with tool Tadpole
POS-tagging performed by Kirsten Vis with tool Tadpole
Annotations for subjectivity added by Kirsten Vis
Quotations annotated by Kirsten Vis
Conversion from original XML to FoLiA XML by Maarten van Gompel with tool vudnc2folia.xslt
OCR output produced by Abbyy Finereader Professional version 8
Automatic Gold standard and OCR alignment by Martin Reynaert with tool Goldie-Oldie
Manual Gold standard and OCR alignment checking and correction by student-assistants
Automatic OCR version inclusion in FoLiA by Martin Reynaert with tool AligntoFoLiA.64.pl
Automatic OCR version inclusion in FoLiA verified and approved by Martin Reynaert with tool CheckAligntoFoLiA.17.pl
Up to three best-ranked spelling correction suggestions automatically added by Martin Reynaert with tool TICCL
Metadata extracted from and updated in FoLiA and CMDI metadata file with normalized metadata created by Martin Reynaert with tool FoLiAtoCMDI.58.pl on the basis of STEVIN SoNaR CMDI template
Resource PIDs provided by CLARIN Centre INL added to CMDI metadata file by Martin Reynaert with tool FoLiAtoCMDI.58.pl
FoLiA XML validation by Martin Reynaert with tool foliavalidator.py