Background

Background

Ryan E. Johnson is a Metadata and Data Curation Specialist with UC San Diego’s Geisel Library. He works on the metadata for digital collection in the UC San Diego Digital Collections, also known as the DAMS (Digital Asset Management System). Besides working with Special Collections and the Digital Library Programs, he is a member of the Research Data Curation Program, where he provides expertise in metadata and solicits information from UCSD researchers about their data.

Originally from Los Angeles, Ryan received his Master’s from Syracuse University in Library and Information Science, with a specialization in Digital Libraries. While currently preoccupied with the world of metadata, he is interested in data cleaning and transformation, addressing the challenges of publishing Linked Data, and collaborating with library information technology professionals to build software that provides users with excellent information.

Experience

Metadata Librarian and Data Curation Specialist, UC San Diego
2014 - current

As a member of the Digital Object Metadata Management Unit in the Geisel Library, I clean, map, and transform metadata from data providers to align with the data model of our DAMS (Digital Asset Management System). I am the Metadata Analyst on about a dozen digital collections, ranging from Special Collections papers to protein sequencing analysis research. I helped to refine and plan new data models for the DAMS that better take advantage of Linked Data, best practices, and decisions that communities (for example the Hydra and Fedora communities) have agreed upon.

I also helped streamline digital collection publication, which went from a project-specific customization that took on average many months (sometimes a year) to a standard process taking mere weeks. This streamlining required planning and collaborating with IT to develop ingest tools that could handle MARC, AT, and Excel source metadata into what we refer to as ‘input streams’.

For context, the UC San Diego DAMS is composed of a triplestore backend (currently undergoing a transition to Fedora 4), Hydra, Blacklight and Solr. The metadata I most commonly work with is serialized in RDF/XML, but I am comfortable in many different serializations of RDF.

Metadata Assistant, Hamilton College
2011-2013

As part of the Digital Humanities Initiative (DHi), I provided metadata expertise to faculty’s Digital Humanities projects, which included work with MODS, RDF/XML, and encoding text into TEI. Hamilton was one of the early adopters of Islandora and the Fedora/Islandora/Drupal stack. I helped develop metadata entry forms, based on an Islandora module, aimed at researchers for self-deposit of their metadata, which required XPath/XQuery knowledge and collaboration with the Islandora community.

Metadata Assistant, Cornell Institute for Social and Economic Research (CISER)
Summer 2011
As an intern, I was tasked with providing metadata for CISER’s burgeoning online data portal. Harnessing the physical codebooks on site, and reconciling to online sources like ICPSR. During my short internship, I vastly increased the amount of metadata present in the repository.

Quality Assurance Tester, PC Games Division, THQ Inc.
2006-2007
As a quality assurance tester for the now-defunct video games publisher THQ, I tested PC games and logged any errors in a database, taking note of local environment and steps to reproduce the bugs. During ‘crunch’ times, I worked overtime and even doubletime in order to meet publishing deadlines. I was offered a permanent position, but declined in order to return to my education.

Education

Syracuse University
M.S., Library and Information Science (2012)
2010-2012

California State University, Northridge (2009)
B.A., Interdisciplinary Humanities
2008-2009

University of California, Santa Barbara
2000-2005

Other Professional Activities

Technical Skills and Expertise

  • Expertise in multiple metadata standards, including MODS, MADS, METS, RDF (and serializations), EAD, schema.org, as well as knowledge of application profiles
  • Experience creating metadata standards, ontologies, application profiles, and data modeling
  • Knowledge of linked data theories and methods, especially as it relates to library data
  • Data cleaning and metadata enrichment expertise
  • Data transformation and enhancement through OpenRefine, APIs (parsing JSON and HTML), SPARQL queries, and XSL-T
  • Basic scripting and regular expression knowledge (Python, shell)
  • Creating static web pages for documentation via GitHub Pages and ReadTheDocs, harnessing Jekyll, Markdown, GitHub, etc for rapid deployment
  • File management and version control software, especially as it relates to data management
  • Basic LAMP (also Nginx) server administration, with PHP and shell scripting relevant to security and automation
  • Experience in the Fedora/Islandora/Drupal stack as well as the Fedora/Hydra/Blacklight stack; involved in the Hydra community
  • Digitization and scanning experience
  • Novice database administration experience (MySQL, SQL Server)
  • Comfortable with software testing in support of rapid deployment and agile/iterative design methods as well as management software (JIRA, Redmine)
  • Experience with many Linux flavors, comfortable with virtual machines and environments (VirtualBox, Vagrant, Docker, the upcoming Otto)