Research Software Engineer – Project: From Books to Knowledge Graphs

Research Software Engineer – Project: From Books to Knowledge Graphs

Published Deadline Location
28 Jan 28 Feb Amsterdam

You cannot apply for this job anymore (deadline was 28 Feb 2021).

Browse the current job offers or choose an item in the top navigation above.

Job description

The Institute for Logic, Language and Computation (ILLC) and the Department of Media Studies of the Faculty of Humanities are looking for a research software engineer with a background in machine learning, knowledge engineering and/or digital humanities, to join the industry collaboration (KIEM 2020) project From Books to Knowledge Graphs.

This project is a collaboration among UvA (Dr. Giovanni Colavizza), the University of Lausanne, Switzerland (Dr. Matteo Romanello) and the publisher Brill, which aims to develop an open framework to extract knowledge graphs from scholarly publications using Brill’s classics catalog as a case study.

Project From Books to Knowledge Graphs

The scientific publishing industry is rapidly transitioning towards information analytics. This shift is disproportionately benefiting large companies. These can afford to deploy digital technologies like knowledge graphs that can index their contents and create advanced search engines. Small and medium publishing enterprises, instead, often lack the resources to fully embrace such digital transformations. This divide is acutely felt in the arts, humanities and social sciences. Scholars from these disciplines are largely unable to benefit from modern scientific search engines, because their publishing ecosystem is made of many specialized businesses which cannot, individually, develop comparable services.

In this project, we aim to start bridging this gap by democratizing access to knowledge graphs – the technology underpinning modern scientific search engines – for small and medium publishers in the arts, humanities and social sciences.​ Their contents, largely made of books, already contain rich, structured information – such as references and indexes – which can be automatically mined and interlinked. We plan to develop an open-source framework for extracting structured information and create knowledge graphs from it.​ The framework will be released with a commercial-friendly license to encourage its re-use. We will as much as possible consolidate existing proven technologies into a single codebase, instead of reinventing the wheel.

What are you going to do?

As  research software engineer you will:

  • help designing an information extraction pipeline which extracts structured entity information from scholarly publications (e.g., references and indexes) and performs entity linkage, in close collaboration with industry partners (Brill);
  • develop such pipeline, by re-using existing solutions or developing novel ones;
  • test and apply the pipeline on Brill’s classics publication catalog;
  • document work and code via comments, tests and technical documentation;
  • publish code and data openly and support early adopters in re-using them;
  • assist and take an active role in the execution of other project-related activities (e.g., outreach).

Specifications

University of Amsterdam (UvA)

Requirements

What do we require?

The following is required:

  • a degree in a related field, including but not limited to digital humanities, computer science, machine learning, knowledge engineering. Alternatively, a humanities (or other) degree with proven experience in research software engineering;
  • excellent written and spoken English;
  • proved capacity to design, develop and document research software. While the project members mostly use the Python stack, other options are also possible.

The following is preffered:

  • previous experience with machine learning, and in particular information extraction tasks (Named Entity Recognition, entity linkage, etc.;
  • previous experience with knowledge engineering;
  • a PhD in a related field (Computer Science, Machine Learning, Knowledge Engineering, Digital Humanities see above);
  • an interest in the SME publishing sector;
  • an interest in Classics and/or digital humanities.

Conditions of employment

Our offer

You will be appointed as Research Software Engineer, ufo-profile ICT developer, for 19 hours per week (0.5 FTE) for a period of 12 months at the Media Studies Department of the Faculty of Humanities. Remote work is the default, given the current situation. The starting date of the contract is 1 April 2021. The gross monthly salary (on full-time basis) will range from €2,790 to €4,012 gross per month, depending on experience and qualifications, in accordance with the Collective Labour Agreement of Dutch Universities. This is exclusive 8% holiday allowance and 8.3% end-of-year bonus.

Employer

University of Amsterdam

With over 5,000 employees, 30,000 students and a budget of more than 600 million euros, the University of Amsterdam (UvA) is an intellectual hub within the Netherlands. Teaching and research at the UvA are conducted within seven faculties: Humanities, Social and Behavioural Sciences, Economics and Business, Law, Science, Medicine and Dentistry. Housed on four city campuses in or near the heart of Amsterdam, where disciplines come together and interact, the faculties have close links with thousands of researchers and hundreds of institutions at home and abroad.  

The UvA’s students and employees are independent thinkers, competent rebels who dare to question dogmas and aren’t satisfied with easy answers and standard solutions. To work at the UvA is to work in an independent, creative, innovative and international climate characterised by an open atmosphere and a genuine engagement with the city of Amsterdam and society.

Specifications

  • IT
  • Natural sciences; Language and culture
  • max. 19 hours per week
  • €2790—€4012 per month
  • Doctorate
  • 21-052

Employer

University of Amsterdam (UvA)

Learn more about this employer

Location

Kloveniersburgwal 48, 1012 CX, Amsterdam

View on Google Maps

Interessant voor jou