Leonardo Mannini
AI/ML Engineer | LLM & RAG Systems | Edge AI / TinyML
AI Engineer specializing in production LLM pipelines, RAG systems, agent-based workflows, backend development, and Edge AI for resource-constrained environments. I build end-to-end AI systems with LangChain, LangGraph, FastAPI, vector search, Docker-based services, and local or hosted LLMs. My research work includes TinyML models for real-time bioacoustic classification on microcontrollers, contributing to a paper accepted at IEEE ICASSP 2026.
Publications
Work Experience
AI Engineer
agosto 2023 ⇾ Present
Designing and deploying LangChain, RAG, LLM, FastAPI, and vector-search systems for business automation and intelligent workflows.
Highlights
- Built agent-based RAG pipelines using LangChain, FastAPI, and FAISS, indexing 1,200+ documents with 87% citation accuracy.
- Reduced response latency from 9.4s to 4.1s through retrieval optimization and caching strategies.
- Developed multi-step AI agents integrating LLM reasoning, external tools, structured outputs, and tool orchestration.
- Designed Docker-based backend services supporting scalable asynchronous workflows and reproducible AI deployments.
- Delivered end-to-end AI systems from model design to deployment.
Embedded AI Researcher (TinyML & Edge AI)
marzo 2025 ⇾ febbraio 2026
Developed TinyML models for real-time audio classification on resource-constrained embedded devices.
Highlights
- Designed lightweight CNN-GRU-Attention architectures for 70-species classification on microcontrollers.
- Processed 150k+ audio files for model training and benchmarking.
- Optimized inference pipelines using TensorFlow Lite Micro and CMSIS-NN.
- Achieved about 16x lower energy consumption compared to standard baseline systems.
- Contributed to research accepted at IEEE ICASSP 2026.
Software Engineer
ottobre 2020 ⇾ aprile 2023
Developed backend applications supporting logistics automation and operational workflows.
Highlights
- Built REST APIs and data pipelines using Python, C++, and Java.
- Automated batch-processing workflows using scheduled tasks and database integration.
- Designed reporting systems reducing manual processing and improving workflow reliability.
- Maintained production systems handling operational data flows.
Education
settembre 2023 ⇾ marzo 2026
Courses
- Thesis: Enabling Multi-Species Bird Classification on Low-Power Bioacoustic Loggers
Computer and Automation Engineering
Polytechnic University of Marche
Bachelor's degree
Score: 26/30 GPA
settembre 2018 ⇾ luglio 2023
Courses
- Thesis: Project of a virtual synthesizer in C++ on the JUCE platform
- Core focus on programming, electronics, automation, and control systems
Musical Instruments Engineering
Polytechnic University of Marche
Professional development programme
Score: 30/30 cum laude
marzo 2022 ⇾ settembre 2022
Courses
- Focus on digital signal processing and audio applications
- Award for best project of the programme
Piano
Conservatory G. Rossini of Pesaro
Conservatory diploma
Score: 10/10
gennaio 2013 ⇾ dicembre 2019
Teaching qualification pathway
Universita degli Studi eCampus
24 CFU
gennaio 2020 ⇾ dicembre 2020
Courses
- Anthropological, psycho-pedagogical, and teaching-methodology training
Awards
Award for best project of the programme
Polytechnic University of Marche, Musical Instruments Engineering
settembre 2022
Winner of the Italian Rapid U16 Chess Championships
giugno 2015
Certificates
Technical Support Fundamentals
Skills
Programming: Python, C++, Java & Bash
LLM & Agentic Systems: LangChain, LangGraph, Retrieval-Augmented Generation (RAG), OpenAI API, Local LLMs, Prompt engineering, Tool-based reasoning & Structured outputs
Backend & APIs: FastAPI, REST APIs, WebSockets, PostgreSQL, SQLAlchemy & Redis
Machine Learning & Edge AI: PyTorch, TensorFlow, PyTorch Lightning, NumPy, Scikit-learn, TensorFlow Lite Micro, CMSIS-NN & TinyML
Vector Search: FAISS & Chroma
Knowledge Engineering: Ontology generation, Knowledge extraction, Knowledge representation, Semantic Web fundamentals & Structured output evaluation
MLOps & Infrastructure: Docker, Terraform, Hydra, TensorBoard & Linux
Audio & Music Technology: JUCE, DSP, Real-time audio processing, MIDI programming & Music production
Languages
Italian: Native
English: C1 / Full professional proficiency (TOEFL iBT 105/120)
Projects
Camadon.ai - Mobile-Native AI Coding Environment
Mobile-native AI coding environment for iOS with terminal execution and AI-assisted workflows.
Highlights
- Developed a mobile-native AI coding environment for iOS with terminal execution and AI-assisted workflows.
- Built secure backend infrastructure with FastAPI, encrypted WebSocket relay, Redis, JWT authentication, Docker, and Terraform.
- Grew the product to 150+ beta users in 3 months, achieving 32% weekly retention and a 4.7/5 rating.
Bird Audio Classification on Edge Devices - TinyML Pipeline
TinyML pipeline for low-power bioacoustic classification, connected to research accepted at IEEE ICASSP 2026.
Highlights
- Developed an end-to-end TinyML pipeline trained on 150k+ audio samples across 70 species.
- Designed lightweight CNN-GRU architectures optimized for microcontroller deployment, reaching up to 90.8% accuracy on distinctive species.
- Deployed real-time inference on AudioMoth firmware using TensorFlow Lite Micro and CMSIS-NN, achieving about 16x lower energy consumption than baseline systems.
Grounded - Full-Stack LLM Research Assistant
Research assistant using LangGraph agents for grounded question answering across large academic corpora.
Highlights
- Built a full-stack research assistant using LangGraph agents for grounded QA across large academic corpora.
- Designed FastAPI retrieval pipelines integrating OCR ingestion, vector search, and persistent PostgreSQL storage.
- Improved citation accuracy to 87% and reduced latency from 9.4s to 4.1s through retrieval and caching optimizations.
GoldenRadio - ATS25 Firmware Enhancement and Licensing System
Enhanced firmware and production licensing system for ATS25 radio devices.
Highlights
- Developed enhanced firmware for ATS25 radio devices, adding new features that gained widespread community adoption.
- Designed a production-grade licensing system for secure firmware activation and controlled distribution.
- Built backend workflows for license validation and device authorization, supporting viral adoption and 100+ device sales in the first month.
Bench4KE Extension - Ontology Generation Benchmark Framework
API-driven benchmark framework for ontology generation and knowledge-engineering systems.
Highlights
- Extended an API-driven benchmark for LLM-based ontology generation and knowledge-engineering systems.
- Designed FastAPI pipelines executing external generators through standardized REST interfaces.
- Built reproducible evaluation workflows combining structural metrics, automated validation, and controlled generation assessment.
Virtual Synthesizer
Bachelor's thesis project: virtual synthesizer in C++ using the JUCE framework.
settembre 2022 ⇾ luglio 2023
Highlights
- Implemented real-time audio DSP algorithms, modular synthesis components, and MIDI control.