## Exposing Institutional Repositories as Linked Data
### _A case study_
### Vitali Peil, Christian Pietsch, Najko Jahn
![Universitätsbibliothek Bielefeld](img/logoverbund_ub.png)
#### Bielefeld University Library
#### SWIB 13
## What this talk is about
* Institutional Repositories
* PUB: An Overview
* Exposing Linked Data
* Perspectives
----
## Repositories - a historical remark
![silo](img/silo.jpg)
silos by Doc Searls http://www.flickr.com/photos/docsearls/5500714140/
----
## What is PUB?
### PUB is the central publication management system for Bielefeld University.
## The PUB System
**Open source, of course**
* Catmandu-Framework developed by the LibreCat group
* ElasticSearch
* NoSQL
* fast
* flexible
### Publication List Manager
![person list](img/personlist.png)
### Publication List Manager
![department list](img/citec.png)
* classical institutional repository
* theses
* OA fulltexts (pre- and postprints)
Research Data (coming soon)
![researchdata](img/researchdata.png)
Project Information (coming soon)
![project](img/project.png)
Science Awards (coming soon)
![award](img/award.png)
## Contextualization in PUB
![overview](img/overview.png)
## Challenges
* heterogeneous data
* different publication types
* different entities: publications, projects, awards, organizational data
* have to use many ontologies/vocabularies
## The PUB Ecosystem
### Data Enrichment
* links to subject repositories
* arXiv.org
* Inspire HEP
* Europe Pubmed Central
* EBI databases
* GenBank
![arxiv](img/arxiv_inspire.png)
![ebi](img/ebi.png)
![citations](img/citations.png)
* links to
* Web of Science
* published version via DOI
Provide Links to Europe Pubmed Central
![labslink](img/labslink2.png)
## The PUB Ecosystem
### Export
* provide export formats
* bibtex, yaml, ris, json, rtf, mods, dc, csv
* problems
* publication centered
* you lose a lot of data
### Embed
* personal publication lists
* author disambiguation
* connected to local administrative system
* department publication lists
* again connected to university's local systems
* via APIs
* SRU, OAI-PMH, and more
Using Disciplinary Infrastructures: _Citec Toolkit_
![tk2](img/toolkit2.png)
![toolkit](img/toolkit.png)
----
# Exposing Linked Data
## First steps
* introduced URIs
* content negotiation
* exposing data using schema.org as microdata
## Ontologies to be considered
* the basic ones
* DCTerms
* FOAF
* BIBO
* ... and the more advanced
* CiTO
* FaBiO
* MODS
* SKOS
* ORE
* VIVO
* Datacite
* CERIF
# Examples
Person
![person](img/person.png)
Publication
![publication](img/publication.png)
Identifiers
![pub2](img/identifier.png)
## Perspectives
* Long-term preservation
* Linked Data Hub @Bielefeld University
* collecting information from PUB, the Directory of Staff and Departments and other research facilities
* Tracking metadata over time
* we started tracking metadata with git
* Memento project
## Conclusion
* repositories are not dead
* just change their scope!
* we have a lot of interesting data!
* identify open access publications
# Thank You!
### Contact:
{vitali.peil, christian.pietsch, najko.jahn} at uni-bielefeld.de
http://pub.uni-bielefeld.de
### Thanks to
the PUB team and
Cord Wiljes (Semantic Computing Group, Bielefeld University).
This work is licensed under a Creative Commons Attribution 4.0 International License.