Curation module in action - preliminary findings on VLO metadata quality
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Numerous problems and suggestions have been reported on the issues of metadata aggregation for VLO (Virtual Language Observatory), one of the core services of CLARIN, over the last years. In response to them, we have developed a metadata curation module which is capable of assembling and reporting a wide range of statistics about CMD (Component Metadata) records, collections, and profiles in the aim of monitoring the issues of metadata quality in VLO. In this paper, we present its on-going development and preliminary findings. With an easy-to-use interactive interface and scoring system, the module has successfully demonstrated to visualise the current state of the VLO. Our first set of analysis outlines unprecedented views on the quality of CMD metadata. We have also identified future works including the user interface, usability, input methods, and the calibration of scoring algorithm. We strongly believe that the curation module has a potential to openly and collectively check and improve the metadata, fostering the comprehensive analysis and assessment of metadata quality to support CMDI and VLO in the long run.
Download slides: clarinannualconference2016_durco_curation_module_01.pdf (2.5 MB)
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !