Course:ARST 556L/LIBR 514L/Metadata Topics/Metadata Processing/Town FNS OpenRefine Project
First Nations Summit OpenRefine | |
---|---|
Nigel Town | |
Semester: | 2022 Winter Term I |
Instructor: | Dr. Julia Bullard |
Metadata Topic(s) | |
Metadata Processing | |
OpenRefine |
Summary
This project was a metadata processing exercise on behalf of the First Nations Summit. The FNS provided me with a sample dataset (Excel spreadsheet) along with instructions for what data they wanted cleaned and what they wanted the values in each column to look like after they had been transformed. I cleaned the data using OpenRefine and provided a report with instructions on how to perform the operations I used as a proof-of-concept that OpenRefine could be used to do this again in the future.
Purpose/Goal
The goal of this project was twofold: to clean the provided dataset according to the instructions provided using OpenRefine as a proof-of-concept that others, like future students, could perform similar work in the future, and to write a tutorial/report of which operations I performed to accomplish this.
I was successful: I addressed every stated field need using OpenRefine functionality, returned a cleaned dataset, and wrote a comprehensive report detailing which actions I performed for which fields and broke down data of concern.
Lessons Learned
I learned more about how to use OpenRefine to accomplish specific transformations to data. Specifically, I learned more about how to use different expression/programming languages like General Refine Expression Language, Python, and Regular Expressions to accomplish these goals. Having used OpenRefine software before this project and taking an introductory Python Programming course while taking this class, I felt I was well-equipped for this project at the outset. However, one thing that might have helped is more discussion with the person who gave me this data and gotten to know more about the context that the data was generated within.
Metadata Network
Topic | Subtopic | Student Projects | Subtopic | Student Projects | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Types | Administrative | Preservation | ||||||||||
Technical | ||||||||||||
Descriptive | ||||||||||||
Use | ||||||||||||
Vocabularies* | See Metadata Options Table (below) | |||||||||||
Design | Class_Wiki | |||||||||||
Collection | ||||||||||||
Processing | Town_FNS_OpenRefine_Project |
Types of Options | Option | Student Projects | Suboption | Student Projects | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Applications | OHMS | ||||||||||
Oxygen XML Editor | |||||||||||
Saxon | |||||||||||
Voyager | |||||||||||
Controlled Vocabularies* | AAT | ||||||||||
CONA | |||||||||||
Homosaurus | |||||||||||
IA | |||||||||||
LCDGT | |||||||||||
LCGFT | |||||||||||
LCNAF | |||||||||||
LCSH | |||||||||||
TGN | |||||||||||
ULAN | |||||||||||
VGMS_Visual_Style | |||||||||||
VIAF | |||||||||||
Identifiers | DOI | ||||||||||
ISBN | |||||||||||
ISNI | |||||||||||
LCCN | |||||||||||
ORCID | |||||||||||
Languages | SPARQL | ||||||||||
SQL | |||||||||||
XML | DTD | ||||||||||
XQuery | |||||||||||
XSLT | |||||||||||
Platforms | Fedora | ||||||||||
Resources | BIBCO | ||||||||||
CONSER | |||||||||||
NACO | |||||||||||
PCC-LOC | |||||||||||
SACO | |||||||||||
Schemas | BIBFRAME | ||||||||||
DCMI | |||||||||||
FRBR | |||||||||||
JATS | |||||||||||
MARC | |||||||||||
MODS | |||||||||||
XSD | |||||||||||
Standards | AACR2 | ||||||||||
EAC-CPF | |||||||||||
EAD | |||||||||||
METS | |||||||||||
MIME | |||||||||||
OpenURL | |||||||||||
PREMIS | |||||||||||
RDA | |||||||||||
RDF | |||||||||||
VRA Core | |||||||||||
Systems | DDI | ||||||||||
Hyacinth | |||||||||||
LibraryWorld | |||||||||||
OAI-PMH | |||||||||||
OCLC Connexion | |||||||||||
RIMS | |||||||||||
Worldox | |||||||||||
Other | Linked Data | ||||||||||
PubMed |