
Giovanni Pinna
Welcome!
I am Giovanni Pinna, an AI Researcher and Engineer based in Trieste, Italy. I hold a Ph.D. in Applied Data Science & Artificial Intelligence from the University of Trieste (March 2026).
My research focuses on the intersection of Natural Language Processing, Large Language Models, and Evolutionary Computation — particularly on improving LLM-generated code through Genetic Improvement techniques and developing evaluation metrics for Text-to-SQL systems.
I have international research experience at University College London (UCL) in London, UK and NOVA IMS in Lisbon, Portugal. I am the author of 10+ publications in top international venues including Scientific Reports (Nature), IEEE Access, and EuroGP.
Interests
- NLP & Large Language Models
- Text-to-SQL
- Genetic Improvement
- AI Coding Agents
- RAG Systems
Education
- Ph.D. Applied Data Science & AIUniversity of Trieste, 2023 โ 2025
- M.Sc. Computer Science EngineeringUniversity of Trieste, 2019 โ 2022
- B.Sc. Computer Science EngineeringUniversity of Trieste, 2015 โ 2019
Experience
- Applied AI ScientistPLUS S.r.l. / Area Science Park, 2023 โ 2025
- Visiting ResearcherUCL โ CREST Centre, Sep โ Dec 2025
- Visiting ResearcherNOVA IMS โ Lisbon, 2024 & 2025
๐๏ธ News
- Apr 2026 Published two papers “Comparing ai coding agents: A task-stratified analysis of pull request acceptance” and “Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests” at MSR 2026 .
- Mar 2026 Completed my Ph.D. in Applied Data Science & AI at the University of Trieste!
- Sep 2025 Started visiting research at University College London (UCL), in Prof. Federica Sarro's group.
- 2025 Published “Redefining Text-to-SQL Metrics” in Scientific Reports (Nature) and 2 papers at SSBSE 2025 (GA4GC and HotCat).
๐ Ph.D. Thesis


๐ Selected Publications
- Redefining Text-to-SQL Metrics by Incorporating Semantic and Structural SimilarityScientific Reports 15.1 (Nature), 2025
- Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request AcceptancearXiv:2602.08915, 2026
- Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull RequestsarXiv:2601.04886, 2026๐ Distinguished Mining Challenge Paper Award, MSR 2026
- Enhancing Large Language Models-Based Code Generation by Leveraging Genetic ImprovementEuroGP 2024, Springer LNCS vol. 14631
- An Artificial Intelligence System for Automatic Recognition of Punches in Fourteenth-Century Panel PaintingIEEE Access, 2023
๐ Recent Posts
๐ Projects
Influence
A platform combining marketing and AI to analyze audiences and generate targeted social media content.
GI for LLM Code
Genetic Improvement pipeline to automatically correct and improve SQL code generated by Large Language Models.
Text-to-SQL Metric
Novel evaluation metric integrating semantic and structural similarity for Text-to-SQL systems. Published in Scientific Reports.
Historical Newspaper NLP
Multilingual NLP pipeline for comparative analysis of Italian and Slovenian historical newspapers from 1902 Trieste.
๐ท๏ธ Popular Topics
๐ ๏ธ Skills
Languages
- Python
- SQL
- Java
- C++
ML & NLP
- PyTorch
- HuggingFace
- scikit-learn
- spaCy / NLTK
- BERTopic
LLM & Agents
- LangChain
- LlamaIndex
- LangGraph
- RAG Pipelines
- Prompt Engineering
Tools
- Git / GitHub
- Docker
- Linux
- LaTeX
- Streamlit / Gradio