5 years in NLP and Machine Learning, with deep expertise in Knowledge Graphs and their integration with Generative AI to address LLM challenges such as hallucination and compliance. Built enterprise-scale Knowledge Graphs (50K+ entities) and end-to-end data pipelines processing 100K+ documents, from schema design to production deployment. Experienced in RDF/OWL/SPARQL stack and property graphs (Neo4j), with 8 peer-reviewed publications in NLP and document analysis. Passionate about applying Knowledge Graphs as key differentiators for Business AI solutions.
Open to relocation
Work Permit: German PR (§18c)
08.2021 - 12.2024
Technical University of Darmstadt
BMBF-funded project InsightsNet, acting as the team's NLP engineer to build production ML systems for knowledge extraction from scientific publications.
04.2020 - 04.2021
Robert Bosch GmbH
Developed deep learning models for time-series forecasting in IoT sensor data (e-Bike sensors).
Programming & ML Stack: Python, C/C++, Java, PyTorch, HF Transformers, sklearn, NumPy, pandas, spaCy, Git, CI/CD, Docker
NLP & Document AI: NER, IE, entity/relation extraction, citation analysis, document layout analysis, PDF parsing, OCR post-processing, semantic/knowledge modeling, Generative AI (LLMs, vLLM, RAG), KG-augmented QA/search, hallucination/fact checking via entity linking
ML, Data & Knowledge Graphs: Transformer fine-tuning (BERT, Llama, Qwen), graph-based approaches & analytics, Academic Knowledge Graph architecture, RDF/OWL/SHACL/SPARQL, property graphs & graph DBs (Neo4j, ArangoDB), data/business schema modeling, annotation pipeline design, evaluation & error analysis, efficient training/inference, reproducible ML
2021 - 2026
Technical University of Darmstadt
Dissertation: Enhancing Scholarly Document Accessibility and Analysis
2018 - 2021
University of Stuttgart
Thesis: Semi-supervised Event-centered Emotion Analysis and Performance Prediction (in collaboration with Robert Bosch GmbH)
2014 - 2018
Henan Normal University
Poster, DTU MLOps Summer School 2022. Built entity-linking pipeline integrating Knowledge Graphs with LLMs to verify factuality and reduce hallucinations.
Open-source developer tool with 1K+ installs providing SPARQL IntelliSense, prefix completion and query authoring for knowledge graphs. [GitHub]
Languages: Chinese, English (C1), German (B1) · Awards: SIGIR JCDL 2025 Travel Grant; ACM-ICPC Bronze 2017