Giovanni Pinna

Giovanni Pinna

AI Researcher & Engineer | Ph.D. in Applied Data Science & AI
University of Trieste

Welcome!

I am Giovanni Pinna, an AI Researcher and Engineer based in Trieste, Italy. I hold a Ph.D. in Applied Data Science & Artificial Intelligence from the University of Trieste (March 2026).

My research focuses on the intersection of Natural Language Processing, Large Language Models, and Evolutionary Computation — particularly on improving LLM-generated code through Genetic Improvement techniques and developing evaluation metrics for Text-to-SQL systems.

I have international research experience at University College London (UCL) in London, UK and NOVA IMS in Lisbon, Portugal. I am the author of 10+ publications in top international venues including Scientific Reports (Nature), IEEE Access, and EuroGP.

Interests

  • ๐Ÿ’ฌ NLP & Large Language Models
  • ๐Ÿ—„๏ธ Text-to-SQL
  • ๐Ÿงฌ Genetic Improvement
  • ๐Ÿค– AI Coding Agents
  • ๐Ÿ” RAG Systems

Education

Experience

  • ๐Ÿ›๏ธ
    Applied AI Scientist
  • ๐Ÿ‡ฌ๐Ÿ‡ง
    Visiting Researcher
    UCL โ€” CREST Centre, Sep โ€“ Dec 2025
  • ๐Ÿ‡ต๐Ÿ‡น
    Visiting Researcher
    NOVA IMS โ€” Lisbon, 2024 & 2025

๐Ÿ—ž๏ธ News

  • Apr 2026 Published two papers “Comparing ai coding agents: A task-stratified analysis of pull request acceptance” and “Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests” at MSR 2026 .
  • Mar 2026 Completed my Ph.D. in Applied Data Science & AI at the University of Trieste!
  • Sep 2025 Started visiting research at University College London (UCL), in Prof. Federica Sarro's group.
  • 2025 Published “Redefining Text-to-SQL Metrics” in Scientific Reports (Nature) and 2 papers at SSBSE 2025 (GA4GC and HotCat).

๐ŸŽ“ Ph.D. Thesis

Thesis Cover
Thesis Spine
Ph.D. Thesis โ€” University of Trieste, 2025
Large Language Models promise to reshape how we write code, query data, and access knowledge — but do they really deliver? This thesis probes that question across four fronts: how we evaluate LLMs, how they democratize expertise, how we make them reliable, and how we deploy them sustainably. A human-centered study on legal texts shows that domain experts still outperform state-of-the-art models precisely where the stakes are highest. To close the reliability gap, I introduce a Genetic Improvement framework that systematically repairs LLM-generated code, and a continuous Text-to-SQL metric that uncovers distinctions hidden by pass-or-fail scores. On sustainability, multi-objective optimization discovers coding-agent configurations with over a hundredfold hypervolume improvement. The result: a path to LLM deployment that is not only powerful, but trustworthy. Supervisors: Prof. Luca Manzoni, Prof. Andrea De Lorenzo.

๐Ÿ“„ Selected Publications

  • Redefining Text-to-SQL Metrics by Incorporating Semantic and Structural Similarity
    G. Pinna, Y. Perezhohin, L. Manzoni, M. Castelli, A. De Lorenzo
    Scientific Reports 15.1 (Nature), 2025
  • Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request Acceptance
    G. Pinna, J. Gong, D. Williams, F. Sarro
    arXiv:2602.08915, 2026
  • Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests
    J. Gong, G. Pinna, Y. Bian, J. M. Zhang
    arXiv:2601.04886, 2026
    ๐Ÿ† Distinguished Mining Challenge Paper Award, MSR 2026
  • Enhancing Large Language Models-Based Code Generation by Leveraging Genetic Improvement
    G. Pinna, D. Ravalico, L. Rovito, L. Manzoni, A. De Lorenzo
    EuroGP 2024, Springer LNCS vol. 14631
  • An Artificial Intelligence System for Automatic Recognition of Punches in Fourteenth-Century Panel Painting
    M. Zullich, V. Macovaz, G. Pinna, F.A. Pellegrino
    IEEE Access, 2023

๐Ÿ“ Recent Posts

๐Ÿš€ Projects

๐Ÿท๏ธ Popular Topics

๐Ÿ› ๏ธ Skills

Languages

  • Python
  • SQL
  • Java
  • C++

ML & NLP

  • PyTorch
  • HuggingFace
  • scikit-learn
  • spaCy / NLTK
  • BERTopic

LLM & Agents

  • LangChain
  • LlamaIndex
  • LangGraph
  • RAG Pipelines
  • Prompt Engineering

Tools

  • Git / GitHub
  • Docker
  • Linux
  • LaTeX
  • Streamlit / Gradio