Skip to content
View OEG-Clark's full-sized avatar

Block or report OEG-Clark

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
OEG-Clark/README.md

Hi there, I'm Clark (ZiYuan Wang) 👋

NLP Research Engineer | Knowledge Graphs | LLMs & Ontology Engineering

I develop AI systems that bridge language models with structured knowledge — building Agentic LLM applications, benchmarking LLMs for various tasks

📍 Currently @ Ontology Engineering Group, Universidad Politécnica de Madrid.

🔬 Research Focus

  • Domain-Adaptive AI Systems: Developing LLM applications that serve as trustworthy collaborators in knowledge-intensive fields, from solar chemistry to ontology engineering with emphasis on explainability and user-centered design

  • LLM Evaluation & Benchmarking: Constructing comprehensive evaluation frameworks that measure not just accuracy, but explanation quality, scientific integrity, and real-world usability across diverse tasks and domains

  • Graph Learning Methodology: Exploring graph learning methods across diverse domains, with a particular focus on graph similarity search, node/link prediction, and adapting graph learning architectures for real-world problems such as software similarity.


🚀 Featured Projects

Solar-QA — LLM-based RAG Pipeline for Solar Chemistry

Question-answering system helping solar chemistry experts review experiments from 700+ academic papers.
Stack: Python | RAG | Information Extraction | LLMs

SOEL LLMs for Ontology Engineering

Mapping LLM capabilities to ontology engineering tasks following LOT methodology.
Focus: Text2Triples, Triples2Onto, Ontology Generation

SoftSim — GNN Software Similarity Dataset

Novel dataset of 6,000+ software repositories for training graph neural networks on code understanding tasks.
Stack: PyTorch | GNN | Graph Learning


🛠️ Tech Stack

Languages: Python | SQL | CUDA
ML/DL: PyTorch | TensorFlow | Scikit-Learn | Transformers
Technologies: Docker | Unix/Shell | RAG Systems | Knowledge Graphs
Specialties: NLP | LLMs | Graph Learning | Ontology Engineering


📫 Contact

Popular repositories Loading

  1. inspect4py inspect4py Public

    Forked from SoftwareUnderstanding/inspect4py

    Static code analysis package for Python repositories

    Python

  2. softsim softsim Public

    Forked from SoftwareUnderstanding/softsim

    Repository to store all preparation and cleaning needed for software similarity

    Jupyter Notebook

  3. PODIO PODIO Public

    Forked from oeg-upm/PODIO

  4. Mint-ModelCatalog-Ontology Mint-ModelCatalog-Ontology Public

    Forked from mintproject/Mint-ModelCatalog-Ontology

    Model Catalog Ontology

    HTML

  5. solar-qa solar-qa Public

    Forked from oeg-upm/solar-qa

    A pipeline to annotate solar chemistry experiments according to solarchem model

    Jupyter Notebook

  6. solar-qa-eval solar-qa-eval Public

    Forked from oeg-upm/solar-qa-eval

    Repository for the solar question answering evaluation

    Jupyter Notebook