[ok] loaded alex.vesa <senior_ai_engineer>
[ok] mounted ~/portfolio · 10y of uptime
[··] awaiting your_brief.md
Building AI systems
that earn their keep.
Ten years architecting, shipping and maintaining machine‑learning products across automotive, medtech and consumer services. I work with teams that need an outside operator — not a slide deck.
The product I sell is the boring discipline that makes the interesting work possible — retraining loops, eval infrastructure, observability, on‑call.
- 01.
Trends are noise. Best practices are evidence. I pay attention to the second.
- 02.
An AI system is a contract with reality, not a demo. If it can't be observed, retrained and rolled back, it doesn't ship.
- 03.
MLOps is the boring discipline that makes the interesting work possible.
- 04.
Strategy that doesn't survive contact with a Dockerfile isn't strategy.
Production AI
Production AI
A field manual for shipping machine learning into systems that have to keep working.
Eighteen chapters on the parts of ML that don't fit in a notebook — retraining, eval, drift, on‑call, cost, and the politics of model rollback. Drafts ship to subscribers monthly.
The Boring Discipline
The Boring Discipline
Twenty‑two essays on MLOps as a craft.
A self‑published collection of long‑form pieces written between 2021 and 2024, gathered and rewritten. Read by ~14,000 engineers and three regulators.
| date | venue | title | format |
|---|---|---|---|
| Jun 2026 | PyData Bucharest Bucharest | Eval‑first development for LLM systems | Keynote |
| May 2026 | MLOps World Berlin | When to throw the model away | Talk |
| Mar 2026 | Stanford MedAI Seminar Remote | Segmenting organs at risk: a postmortem | Lecture |
| Nov 2025 | NeurIPS Workshop Vancouver | The retraining loop is the product | Paper + talk |
| Sep 2025 | AI Engineer Summit San Francisco | Production patterns for retrieval | Workshop |
| Jun 2025 | Devoxx Romania Bucharest | MLOps for backend engineers | Talk |
| Mar 2025 | EuroPython Prague | Three years of PyTorch in production | Talk |
AI Systems Architecture
From greenfield to legacy. I design the model, data and infrastructure layers as one system — not three separate procurement decisions.
- Architecture review & technical due diligence
- Model + data + serving topology
- Cost, latency and observability budgets
- Hiring plan for the team that will run it
MLOps Engagement
I embed with your team and build the boring scaffolding — eval, retraining, monitoring, on‑call — that makes the interesting work possible.
- CI/CD for models, data and prompts
- Eval infrastructure & regression gates
- Drift, cost and quality dashboards
- Runbooks and on‑call rotation design
Executive Advisory
For founders, CTOs and product leaders who need a working AI engineer in the room when the decisions are made. Quietly, on a schedule.
- Monthly working sessions
- Roadmap and hiring review
- Vendor and architecture sanity checks
- On‑call for 'should we?' decisions
The right time to call me is before the proof of concept is dressed up as the product.
Workspace OS for clinical data
An AI workspace that turns scanned referral chaos into structured, queryable patient data. LLM extraction + OCR + a stubborn schema. Deployed across 14 hospital networks.
Organs‑at‑risk segmentation
2.5D and 3D CNN segmentation pipeline used in radiation therapy planning. Sub‑millimetre tolerance, traceable retraining, and a paper trail that survives an FDA conversation.
Night‑vision driver assist
Real‑time pedestrian and animal detection on infrared imagery, running on automotive‑grade silicon. Three OEM integrations, one of which still ships today.
Social media post generator
Early LLM product, built on first‑gen GPT‑3. Brand voice fine‑tuning, scheduled generation, human approval queue. A useful lesson in what LLM products were actually for.
Languages
- Python
- Go
- Node.js
- SQL
ML / DL
- PyTorch
- TensorFlow
- Keras
- MONAI
- OpenCV
- CUDA
Data & infra
- Postgres
- Elasticsearch
- Redis
- Kafka
- Qdrant
- Chroma
Cloud & ops
- AWS
- Docker
- Terraform
- CDK
- MLflow
- W&B
- Langfuse
Frameworks
- FastAPI
- Django
- Flask
Write to me plainly.
I read everything.
A short note about what you're building and where it's stuck is the fastest way to start. I usually reply within 48 hours.
alex.vesa@cube-digital.io