BBWorksB (Manyara Baraka) · GitHub
Skip to content
View BBWorksB's full-sized avatar

Highlights

  • Pro

Block or report BBWorksB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BBWorksB/README.md


"The gap between a model that works in a notebook and a system that works in production is where most ML engineers stop. That gap is where I operate."


👤 About

I am a Machine Learning Engineer, AI System Designer, and Data Scientist — rigorous, systematic, and wired differently. I design and build systems that sit at the intersection of mathematical depth and real-world consequence: multi-LLM orchestration pipelines, computer vision systems, NLP architectures, cloud-native deployments, and end-to-end data engineering at production scale.

I don't build prototypes. I build systems that run on real infrastructure, process tens of thousands of records, and deliver measurable outcomes in environments where failure is not an option.

That foundation is shaped by an uncommon academic combination: a Master's in Engineering AI from Carnegie Mellon University Africa, built on an undergraduate degree in Applied Physics and Computer Science. Physics gave me the discipline to model complex systems from first principles. Computer Science gave me the tools to implement them. Carnegie Mellon gave me the rigour to engineer them at scale. Together, they make me the kind of engineer who does not just apply ML — I design the systems around it.

I have mentored 100+ students across AI/ML, Data Science, and Data Analytics programmes at Moringa School, translating deep technical complexity into structured, actionable understanding for learners at every level of expertise.


🏗️ Flagship Project

Multi-LLM Consensus Validation System

Production Deployment — Aviation Safety

Designed and deployed an AI system that autonomously validates aviation safety reports at production scale. The core challenge: no single model is reliable enough for safety-critical classification. The solution is a novel ensemble architecture that leverages self-consistency reasoning across five models, governed by a deterministic consensus engine — achieving automation at a level that removes human bottlenecks without sacrificing accuracy.

Architecture

  • 5-model ensemble: 2 fine-tuned BERT classifiers + 3 Gemini LLMs at distinct temperature settings (0.3 / 0.5 / 0.7)
  • 7-rule consensus engine routing each prediction based on inter-model agreement patterns
  • Asynchronous processing pipeline with rate limiting, retry logic, and full error recovery
  • Containerized with Docker, orchestrated with Kubernetes, production deployment
  • PostgreSQL backend · FastAPI serving layer · cron-scheduled batch processing

Results — 3,600+ validated reports

Metric Result
Automation rate 95.06%
LLM agreement rate 89.81%
Average processing time 2.15 seconds / report

Python FastAPI PostgreSQL Docker Kubernetes Gemini API SciPy NumPy

  BERT Classifier ─────┐
  BERT Classifier ─────┤
                        ▼
  Gemini (t=0.3) ──► 7-Rule
  Gemini (t=0.5) ──► Consensus
  Gemini (t=0.7) ──► Engine
                        │
                        ▼
               Auto-Validated
               Safety Reports
                 (95.06%)

🛠️ Projects

Wolframite Classification — Materials Science Research

Computational research combining machine learning with Quantum ESPRESSO simulations to predict mineral properties from first-principles physics data. Bridges computational physics and applied ML — manuscript in preparation.

Python Quantum ESPRESSO Scikit-learn Statistical Modeling

Automated Weed Detection — Precision Agriculture

Deep learning system for real-time crop-weed classification using CNNs and transfer learning. Designed to reduce chemical application through targeted identification, with direct application to sustainable farming at scale.

Python TensorFlow OpenCV NumPy

Feature Detection & Panorama Stitching

End-to-end computer vision pipeline implementing SIFT and ORB feature matching with RANSAC-based outlier rejection, and multi-band Laplacian/Gaussian pyramid blending for seamless panoramic reconstruction.

Python OpenCV NumPy SciPy Matplotlib

Swahili Audio Classification

Sequence-to-sequence RNN with attention mechanisms for multilingual audio transcription in Swahili — a low-resource African language. Demonstrates capability in building NLP systems beyond English-dominant benchmarks.

Python TensorFlow/Keras NLTK Librosa

Cloud-Based Data Management System

Distributed NoSQL architecture using Cassandra for real-time data storage and retrieval, containerized with Docker and orchestrated via Kubernetes on Azure with automated scaling and load balancing.

Python Cassandra Docker Kubernetes Azure

Agricultural Geospatial Analysis

Remote sensing and satellite imagery analysis for sustainable land management using Google Earth Engine. Applied spatial analysis techniques to support evidence-based environmental monitoring.

Python Google Earth Engine Spatial Analysis


🔭 Currently Working On

Project Description Status
Basic_to_ML A systematic series rebuilding ML and deep learning concepts from scratch — each concept first implemented at its simplest form, then extended into a full working system. Inspired by the 100-day challenge format but taken further: Snake Game ✓, Pong (next), and expanding. Not a challenge log — a structured engineering reference. Active
Wolframite Classification ML-driven mineral property prediction from quantum simulation data Manuscript in preparation
AI for Science Research Exploring AI applications in computational physics and materials science Active

💼 Experience

Technical Mentor — Moringa School (Jul 2024 – Present) Designed and delivered end-to-end AI/ML curriculum covering supervised learning, deep learning (CNNs, RNNs, Transformers), NLP, and model deployment to cohorts of 100+ students. Built original teaching frameworks, debugging methodologies, and code examples from first principles. 95% student retention rate.

Materials Science Research Intern — CMU Africa (May – Aug 2025) Computational research using Quantum ESPRESSO to simulate and analyse material properties. Applied statistical modelling to interpret simulation outputs. Contributing to wolframite classification research — manuscript in preparation.

Teaching Instructor — Young Engineers Kenya (Feb – Jul 2024) STEM instruction for learners aged 5–17 covering Python, HTML/CSS, Scratch, and robotics involving sensor integration and control algorithms.

Technical Consultant — Institute of Energy Studies and Research (Sep – Nov 2022) IT infrastructure support, network management, and system optimisation across research operations.


🎓 Education

Degree Institution Period
M.Sc. Engineering AI Carnegie Mellon University Africa, Kigali Aug 2024 – May 2026
BSc. Applied Physics & Computer Science Multimedia University of Kenya Sep 2018 – Dec 2022

📰 Research & Publications

  • Wolframite Mineral Classification Using Machine Learning — Computational materials science integrating ML with Quantum ESPRESSO simulations. Manuscript in preparation, expected 2026.
  • Multi-LLM Consensus Validation Architecture for Aviation Safety — Novel 5-model ensemble system with deterministic consensus engine for safety-critical classification. Research documentation on system design and validation methodology.

⚙️ Tech Stack

Languages

Python R SQL Bash

Machine Learning & AI

TensorFlow PyTorch Scikit-learn OpenCV Keras NLTK

Data Engineering

PostgreSQL MongoDB Cassandra Apache Spark MLflow FastAPI

Infrastructure

Docker Kubernetes Azure

Visualization & BI

Tableau PowerBI Matplotlib Jupyter


📊 GitHub Stats


🤝 Community

International Ministers Affairs Representative — CMU Africa (Dec 2024 – Present) Representing international students across institutional processes, providing emergency coordination and facilitating cross-cultural engagement.

Campus Director, Hult Prize Kenya — Multimedia University (Mar – Nov 2022) Led coordination of 20+ innovation teams competing on sustainable development challenges at the global stage.



If the work above interests you, I am worth a conversation.


 



Pinned Loading

  1. Computer-Vision-Explained Computer-Vision-Explained Public

    Every assignment for Computer Vision is clearly corrected and submitted here

    Jupyter Notebook 1

  2. Math-for-AI Math-for-AI Public

    All math concepts needed for AI covered

    Jupyter Notebook 1

  3. Principles-and-Engineering-Applications-of-AI Principles-and-Engineering-Applications-of-AI Public

    All the fundamental of AI from decision making to implimentation

    Jupyter Notebook 2 1

  4. Python_beginners_projects_codes Python_beginners_projects_codes Public

    Let's start your python journey together.

    Python

  5. LEE-NANGI/Swahili_Classification_Project LEE-NANGI/Swahili_Classification_Project Public

    Jupyter Notebook 1