Course:ARST 556L/LIBR 514L/Metadata Topics/Metadata Processing/Town FNS OpenRefine Project

From UBC Wiki
First Nations Summit OpenRefine
Image:wiki.png
Nigel Town
Semester: 2022 Winter Term I
Instructor: Dr. Julia Bullard
Metadata Topic(s)
Metadata Processing
OpenRefine

Summary

This project was a metadata processing exercise on behalf of the First Nations Summit. The FNS provided me with a sample dataset (Excel spreadsheet) along with instructions for what data they wanted cleaned and what they wanted the values in each column to look like after they had been transformed. I cleaned the data using OpenRefine and provided a report with instructions on how to perform the operations I used as a proof-of-concept that OpenRefine could be used to do this again in the future.

Purpose/Goal

The goal of this project was twofold: to clean the provided dataset according to the instructions provided using OpenRefine as a proof-of-concept that others, like future students, could perform similar work in the future, and to write a tutorial/report of which operations I performed to accomplish this.

I was successful: I addressed every stated field need using OpenRefine functionality, returned a cleaned dataset, and wrote a comprehensive report detailing which actions I performed for which fields and broke down data of concern.

Lessons Learned

I learned more about how to use OpenRefine to accomplish specific transformations to data. Specifically, I learned more about how to use different expression/programming languages like General Refine Expression Language, Python, and Regular Expressions to accomplish these goals. Having used OpenRefine software before this project and taking an introductory Python Programming course while taking this class, I felt I was well-equipped for this project at the outset. However, one thing that might have helped is more discussion with the person who gave me this data and gotten to know more about the context that the data was generated within.

Metadata Network

Metadata Topics
Topic Subtopic Student Projects Subtopic Student Projects
Types Administrative Preservation
Technical
Descriptive
Use
Vocabularies* See Metadata Options Table (below)
Design Class_Wiki
Collection
Processing Town_FNS_OpenRefine_Project
Metadata Options
Types of Options Option Student Projects Suboption Student Projects
Applications OHMS
Oxygen XML Editor
Saxon
Voyager
Controlled Vocabularies* AAT
CONA
Homosaurus
IA
LCDGT
LCGFT
LCNAF
LCSH
TGN
ULAN
VGMS_Visual_Style
VIAF
Identifiers DOI
ISBN
ISNI
LCCN
ORCID
Languages SPARQL
SQL
XML DTD
XQuery
XSLT
Platforms Fedora
Resources BIBCO
CONSER
NACO
PCC-LOC
SACO
Schemas BIBFRAME
DCMI
FRBR
JATS
MARC
MODS
XSD
Standards AACR2
EAC-CPF
EAD
METS
MIME
OpenURL
PREMIS
RDA
RDF
VRA Core
Systems DDI
Hyacinth
LibraryWorld
OAI-PMH
OCLC Connexion
RIMS
Worldox
Other Linked Data
PubMed