Knowledge Graph Engineering
Welcome to the homepage of the fall 2023 edition of Knowledge Graph Engineering course of the Computer Science degree at the University of Trento.

 

 

News


For the students who intend to do the projects presented in class (Lecture #8), as well as to access the final exam, it is MANDATORY to fill the following form by specifying the preferences about the availbale projects Projects Proposals Form. DEADLINE 7/10

October 3rd, 2023

 

 

For the students who intend to follow the course, as well as to access the final exam, it is MANDATORY to fill the following Participant Information Form.

September 11th, 2023

 

 

Register on the Moodle page to receive the lecturers communications. You can find the course on moodle following the path:
Courses/ 2023/2024 / Dipartimento di Biologia Cellulare, Computazionale e Integrata - Department of Cellular, Computational and Integrative Biology - CIBIO / Knowledge Graph Engineering [146114] - GIUNCHIGLIA

September 11th, 2023

 

 

This class will start on Monday Sept 11th. The Calendar section of this website is still under review.

August 3rd, 2023

 

 

 

 

Last modification: September 11th, 2023

Instructions


The 2023 edition of KGE is taught in presence. As for the last years, presence, even if not a formal requirement, is strongly suggested given that this is a hands-on lab course. Passing the exam amounts to developing a project, which ultimately will lead to the generation of a Knowledge Graph (and support documentation) starting from data provided by the lecturers. This course and project work will develop under the continuous supervision of the lecturers, in collaboration with a colleague. There are no easy or cost-effective ways to develop the project without a continuous presence in class.

The lectures will take place following the scheduling indicated in the section Calendar and Material. The course material includes slides, demo videos, support resources and links, all provided on the web sit. After each main phase of the project, there will be a Q&A lecture during which the students can ask questions about all their open problems and doubts.

At the end of the course students will be asked to fill an online questionnaire about the overall process and methodology they will have learned. This feedback is very important to us, as it is the basis for a continuous evolution and improvement of the course and methodology being taught. To this extent students are strongly encouraged to raise doubts, ask questions, discuss the doubts they have about the methodology itself during the Q&A lectures.

Syllabus


Course Objectives and Outcomes

The Knowledge Graph Engineering (KGE) course aims to teach what are Knowledge Graphs, which are their possible usages and features, and what it means to build a KG. During the curse the students will discover the different issues to be addressed when a KG is built, learning which are the activities to be executed to that end, as per the current state of the art on Knowledge Graph Engineering. The course will teach an innovative methodology for Knowledge Graph Engineering, called iTelos. The methodology, will be applied by the students over real-world use case. By applying iTelos, the students will learn how to execute together the several activities involved in the construction of Knowledge Graphs. Moreover, such a process will be supported, within the course, by the usage of specific tools and libraries that the students will learn in order to solve the issues encouterd.

 

This course is taught in English. The intended target are the students of the master degree in Computer Science of the Department of Computer Science and Information Engineering (DISI) of the University of Trento. This is a 14 week, 48 hours, six credit, advanced course on how to develop a Knowledge Graph (KG) starting from data – to be cleaned and adapted – which are already available. This is a hands-on course. After a few introductory classes, students are given a problem to solve and they will build a knowledge graph solving this problem. A limited set of teaching material is available, mainly in form of slides. The student will learn mainly by doing the actual work and by interacting with the teacher and tutors. The exam consists in: writing a project report, giving a demo and making a public presentation.

 

General Description

This course will cover the following topics:
  • What are Knowledge Graphs (KGs)
  • What Knowledge Graphs can be used for, and example of already used KGs
  • What does it means to build a KG
  • How to reduce the cost of data use and reuse, exploiting KG
  • How to solve the different problems involved in KG construction, using the iTelos KGE methodology
  • How to use new and existing tools and libraries to address the problems encounterd in KGs construction
  • How to develop an entire project of KGE on real-world case studies

 

Prerequisites

  • Data management: basic programming skills in python and/or java/javascript
  • Databases modeling: ER modeling, (Ontology modeling if possible, Ontology definition desirable)
  • Web languages (mainly RDF and OWL)
  • Attitude to teamwork

 

Course modality

Theory:
  • Lectures about KGE and the iTelos methodology, which aims to be the approach to be used to achieve the course objectives. iTelos is divided into 4 main phases that will shape the theoretical body of the course.
Practie:
  • During the whole course the students (grouped in teams) will be asked to carry on a KGE project assigned by tutors, relative to real-world case studies. The student will apply the iTelos methodology to produce a KG suitable to satisfy the purpose of the projects assigned.
Modality:
  • Theory and practice will go on in parallel. The lectures will describe the problems and the solutions, proposed by the iTelos methodology, that will be then immediately applied in practice over the projects assigned.
  • The course requires the student's presence in the classroom for the theoretical lectures (difficult to be learnt by only reading the slides provided lecture by lecture). Moreover, a strong cooperation between the team members is required to carry on the project's development along the course.

Teachers


Fausto Giunchiglia
Simone Bocca
Mayukh Bagchi
Xiaoyue Li
Matteo Busso
Fausto Giunchiglia
Simone Bocca
Mayukh Bagchi
Xiaoyue Li
Matteo Busso
fausto.giunchiglia@unitn.it
simone.bocca@unitn.it
mayukh.bagchi@unitn.it
xiaoyue.li@unitn.it
matteo.busso@unitn.it

Calendar and Material


The course runs from Sep, 11, 2023 till Dec XX, 2023 with the following schedule

     

  • Monday, 17:30-19:30, Room A209

  •  

  • Wednesday, 9:00-11:30, Room A224

 

You might want to read the Instructions to understand how to take the course.

 

Notice also the titles and structure of the lessons yet to be delivered might change slightly. The rule of the thumb is: if there are links with materials, things won’t change; if there are no links to the materials, titles and content are just suggestions.

 

Lesson Number Date                                  Time Material                              Content of Material Lecturer(s)                 External resources                         Phase documentation deadline                        
1 Mon 11 Sep, 2023 17:30 Slides Course Organization F. Giunchiglia KGE projects catalog
2 Wed 13 Sep, 2023 9:00 Slides The data reuse problem F. Giunchiglia
3 Mon 18 Sep, 2023 17:30 CANCELED F. Giunchiglia
4 Wed 20 Sep, 2023 9:00 Slides State of the art S. Bocca
5 Mon 25 Sep, 2023 17:30 Slides The solution - iTelos F. Giunchiglia
6 Wed 27 Sep, 2023 9:00 Slides The solution - iTelos F. Giunchiglia Stream data
7 Mon 2 Oct, 2023 17:30 CANCELED F. Giunchiglia
8 Wed 4 Oct, 2023 9:00 Projects
Slides
Projects proposals
iTelos Methodology
S. Bocca OSM Resources
9 Mon 9 Oct, 2023 17:30 Slides Phase 1 - purpose definiton F. Giunchiglia
10 Wed 11 Oct, 2023 9:00 Q&A - phase 1 S. Bocca
11 Mon 16 Oct, 2023 17:30 Slides Phase 2 - Information Gathering F. Giunchiglia RDF-OWL Project report - Phase 1
12 Wed 18 Oct, 2023
Room A201
9:00 Video - Data Management Phase 2 - Information Gathering - Practice F. Giunchiglia Karma
Protege
Protègè guidelines
13 Mon 23 Oct, 2023 17:30 Q&A - phase 2 F. Giunchiglia
14 Wed 25 Oct, 2023 9:00 Slides Phase 3 - Language Definition F. Giunchiglia Project report - Phase 2
15 Mon 30 Oct, 2023 17:30 Language Spreadsheet Template Phase 3 - Language Definition - Practice F. Giunchiglia Language Definition tool instructions
16 Mon 6 Nov, 2023 17:30 Q&A - phase 3 F. Giunchiglia
17 Wed 8 Nov, 2023 9:00 Slides Phase 4 - Knowledge Definition F. Giunchiglia Project report - Phase 3
18 Mon 13 Nov, 2023 17:30 Slides Phase 4 - Knowledge Definition - Practice F. Giunchiglia
19 Wed 15 Nov, 2023 9:00 Q&A - phase 4 F. Giunchiglia
20 Mon 20 Nov, 2023 17:30 Slides Phase 5 - Data Definition - Theory & Practice F. Giunchiglia Project report - Phase 4
21 Wed 22 Nov, 2023 9:00 Q&A - phase 5 F. Giunchiglia
22 Mon 27 Nov, 2023 17:30 Slides KG's Evaluation F. Giunchiglia
23 Wed 29 Nov, 2023 9:00 CANCELED F. Giunchiglia
24 Mon 4 Dec, 2023 17:30 Metadata
SPARQL
SPARQL-Demo
Metadata and
Query the Graph
F. Giunchiglia GraphDB
SPARQL-book
SPARQL-W3C
25 Wed 6 Dec, 2023 9:00 Q&A general F. Giunchiglia
26 Mon 11 Dec, 2023 17:30 Q&A general F. Giunchiglia Project report - Phase 5
27 Exams dates Registration sheet Registration sheet
28 Questionnaires Methodology
Tools
Evaluation Questionnaires

Exam


After the completion of each iTelos methodology phase (both concerning theory and practice) the students will have to provide an intermediate report of the work done so far, which will be checked and evaluated by lecturers. This intermediate evaluation will allow the lecturers to lead the teams towards the right direction by correcting possible errors during the methodology implementation.

The final exam will consist of a presentation of the KGE projects developed along the course and finalized achieving the output required by the initial project's purpose. In addition some questions can be asked, at the end of the project presentation, to evaluate the students over the key aspect of the iTelos methodology.

Collaboration Opportunities


Multiple positions are available as 150h and internships. They should be considered as the first part of a research project and thesis with the Knowdive group. The general activities of the group are listed on the website (http://knowdive.disi.unitn.it/), while activities already scheduled and available now can be found at http://knowdive.disi.unitn.it/work-with-us/. The 150h activities have variable length and are strictly related to software development: for this reason, knowledge of software development with at least onr programming language is a must. All the activities can also be carried on in a remote fashion.

 

Anyone interested in these opportunities can send an email to knowdive-positions@disi.unitn.it, providing already information about preferences in terms of topics or activities (if known). For 150h activities it is important to provide information about known programming languages with the corresponding level, a value in the range [1 - 5] where 1= basic knowledge, 5= advanced knowledge.

 

The applications to the “150 ore” program can be done at the link:
https://www.unitn.it/servizi/224/collaborazioni-studenti-150-ore
Notice that the deadline for applications for the A.Y 2023-2024 is September 30, 2023