Bachelor Thesis

Automated knowledge graph creation via multi-object detection for an AI maintenance assistant

Overview

This research project addresses the gap between visual object detection and structured knowledge representation. I developed a system that automatically converts bounding box data from images of industrial machines into formal Knowledge Graphs (RDF/OWL). These graphs serve as a structured knowledge base for Large Language Models (LLMs), enabling them to answer spatial and maintenance-related questions in an industrial context without hallucinations.

Tech Stack

Pythonowlready2RDF / OWLDeepSeek / Llama / QwenSSD Object DetectionGit

The Challenge

Creating Knowledge Graphs manually is time-intensive and error-prone. While text-to-graph methods exist, there is a significant lack of automated approaches that transform visual object detection data into formal, logic-based knowledge structures that LLMs can process efficiently for industrial maintenance tasks.

The Solution

I designed and implemented a modular Python based pipeline (`OntologyGenerator.py`) that ingests CSV output from Multi-Object Detection models. Using geometric algorithms, it automatically derives semantic spatial relations (e.g., 'left_of', 'above', 'inside_of') and instantiates them into an OWL ontology. The system supports multi-camera fusion to minimize the uncertainty and problems of object detection.

System Architecture

The system processes annotated image data (CSV) containing bounding boxes. It calculates the center of each object and applies threshold-based logic to determine spatial relationships. It utilizes `owlready2` to generate standard-compliant RDF triples. The output was evaluated against four LLMs (DeepSeek-R1, DeepSeek-V3, Llama 3.1, Qwen 2.5) using specific metrics for correctness and completeness.

Key Features

Geometric Relation Extraction

Algorithms to automatically determine 'above', 'below', 'left_to', and 'inside_of' relations based on pixel coordinates.

Multi-Camera Fusion

Logic to merge object detection data from multiple angles into a single consistent Knowledge Graph.

Implicit vs. Explicit Modeling

Implemented variants to test graph compactness versus semantic completeness for LLM processing.

LLM Integration

Evaluated different serialization formats (OWL vs. Triples) to optimize LLM reasoning capabilities.

Project Gallery

Development Timeline

May 2025

Research & Concept

Analyzing state of the art in Semantic Web and Object Detection.

Jun 2025

Implementation

Developing the Python generator and relation logic algorithms.

Sep 2025

Submission

Final evaluation of LLM performance and thesis submission at TU Dresden.

Oct 2025

Defense

Final defense of thesis at TU Dresden. Received a grade of 1.4