I build data-centric, explainable, and interactive AI systems, from problem framing and data pipelines to model evaluation, dashboards, APIs, and decision-support tools.
My work focuses on projects where machine learning is not only trained, but also tested, explained, documented, and connected to a real user or business decision.
Main focus areas:
- Retrieval-Augmented Generation and graph-based AI systems
- Risk modeling, fraud detection, underwriting, and decision safety
- NLP systems with responsible evaluation and uncertainty handling
- Synthetic data generation and realism evaluation
- SQL + Python machine learning workflows
- Streamlit/FastAPI apps for interactive model use
Open to: Data Scientist, Machine Learning Engineer, and Applied AI roles, especially projects involving explainability, decision support, and production-minded ML workflows.
Projects that move beyond notebooks into repeatable workflows:
- data cleaning and validation
- feature engineering
- train/test evaluation
- cross-validation and model comparison
- uncertainty and threshold handling
- saved artifacts and reproducible outputs
- tests and CI where appropriate
I like building model interfaces that a user can actually interact with:
- Streamlit dashboards
- FastAPI backends
- CLI tools
- batch scoring workflows
- visual reports
- decision-support outputs
I try to make model behavior understandable through:
- honest limitations
- model cards and documentation
- leakage analysis
- SHAP and feature importance
- calibration and abstention
- uncertainty bands
- human-review workflows
- Graph-RAG-Engine | Graph intelligence, vector search, and explainable retrieval-augmented generation.
- PR-Guardian-AI | AI-assisted pull request review workflow using GitHub App patterns and OpenAI integration.
- RAG-vs-Fine-Tuning | Practical framework for deciding between retrieval and fine-tuning.
- Designing-Hybrid-AI-Systems | Notes and patterns for hybrid AI design.
- Financial-Fraud-Risk-Engine | Fraud-risk workflow with cost-sensitive thresholding and SHAP explanations.
- Underwriting-Decision-Safety-Lab | Calibration, abstention, and defensible loan decision policies.
- Onchain-Security-Suite | Web3 security pipeline with static analysis and deployer risk scoring.
- Synthetic-Data-Artist | Copula vs VAE synthetic tabular data comparison and diagnostics.
- Autocurator-Synthetic-Data-Benchmark | Synthetic data benchmarking across fidelity, coverage, privacy, and utility.
- Missing-Data-Doctor | Missingness profiling and imputation impact analysis.
- Coffee-Shop-Profit-Predictor | SQL + ML workflow for retail location profitability prediction.
- Forecast-Factory | Forecasting and scenario simulation app for business decision support.
- Data-Storytelling-Dashboard | E-commerce analytics dashboard with KPIs, cohorts, retention, and business narrative.
- Market-IQ | BI-style analytics and KPI exploration.
- Fake-News-Detector | Responsible text classification with uncertainty handling and leakage analysis.
- Sentiment-Analysis-BERT | BERT fine-tuning pipeline for sentiment classification.
- ML-Playground-Autodetect | Interactive ML playground with automatic task detection.
- Building more production-style ML projects with tests, CI, and reproducible workflows.
- Improving RAG systems with better retrieval evaluation, traceability, and source grounding.
- Expanding risk and decision-safety projects with calibration, abstention, and monitoring ideas.
- Turning analytics projects into clearer decision-support tools with stronger documentation and outputs.
- Email: amirhosseinhonardoust@gmail.com
- LinkedIn: honardoust
- X: @amirhonardoust
AI is not just about models; it is about systems that solve real problems for real people.






