Skip to content

raihan-js/raihan-js

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

45 Commits
Β 
Β 

Repository files navigation

Hey there, I'm Raihan πŸ‘‹

AI/ML Engineer Β· CTO at ClarioScope AI Β· From Bangladesh

Train small language models (SLMs) from scratch Β· Fine-tune larger ones with QLoRA Β· Ship production AI products

Portfolio Hugging Face LinkedIn Email


What I'm Working On

  • 🧠 Training small language models from scratch β€” the ORCH series (350M–3B) for Next.js code generation, and MedLLM-10M for medical applications
  • πŸ”§ Fine-tuning larger base models with QLoRA β€” ORCH-7B is a 4-bit fine-tune of DeepSeek Coder 6.7B
  • 🎯 Building benchmark-grade specialist SLMs β€” clarioscope-intent-deberta-v1 matches frontier LLMs within 4 pp of accuracy at 22Γ— lower latency and $0/inference (dev.to writeup)
  • πŸ₯ Leading engineering at ClarioScope AI β€” HIPAA-compliant healthcare practice growth platform
  • πŸš€ Shipping production AI products β€” BeautyCrew AI, VETR Proposal, CommonRoom AI
  • πŸ“š Open-source everything β€” all model weights, configs, and tokenizers are public on Hugging Face

πŸ† Latest ship: the full ClarioScope SLM Suite (all three models shipped)

A three-model intake intelligence pipeline for healthcare practices. Each model is small, specialized, and benchmarked head-to-head against frontier APIs. Suite-level writeup: Three small models for healthcare intake β€” and what shipping all three taught me.

Model Task Size Headline result Speed vs frontier Cost / 1K Links
clarioscope-intent-deberta-v1 7-class intent classification 184M 91.16% accuracy (within 4 pp of Claude Haiku) 22Γ— faster $0 πŸ€— Β· πŸ“
clarioscope-phi-deberta-v1 18-category HIPAA PHI span detection 125M Macro F1 0.63 (triples frontier on LOC, ties on NAME/DATE/PHONE/IP/AGE) 45Γ— faster $0 πŸ€— Β· πŸ“
clarioscope-insurance-v1 12-field insurance / billing extraction 125M Macro F1 0.79 (ties GPT-4o on SUBSCRIBER_NAME, within 5–13 pp on the four highest-volume fields) 26Γ— faster $0 πŸ€— Β· πŸ“

Total cost to build all three: ~$16 in OpenAI + RunPod + benchmark API spend. Total infrastructure: Hugging Face (free) + RunPod spot pods (a few cents per run).

The recurring pattern across all three: small specialized models don't replace frontier APIs β€” they're stage one of a hybrid pipeline that does the bulk-volume work cheaply, then defers a small fraction of hard cases to a frontier API. All three model cards include honest per-entity / per-class breakdowns showing where the small model wins and loses.

Per-entity F1 β€” PHI detector vs frontier APIs


AI Models I've Trained

All published openly on πŸ€— Hugging Face. Configs and tokenizers included.

🎼 ORCH Next.js 3B

A 3 billion parameter decoder-only transformer trained from scratch for full-stack Next.js code generation.

Spec Value
Parameters ~3.0B
Architecture Custom LLaMA-style
Layers / Hidden 32 / 2,560
Attention / KV (GQA) 32 / 8
Vocab 32,000 (custom)
Context 16,384 tokens
Hardware NVIDIA A40 48GB (RunPod)

Model

πŸ”§ ORCH-7B

QLoRA fine-tune of DeepSeek Coder 6.7B Instruct, specialized for autonomous Next.js generation.

Spec Value
Base model DeepSeek Coder 6.7B Instruct
Method QLoRA (4-bit NF4 + LoRA)
Training 43h on a single A100
Steps 5,238
Context 16,384 (linear RoPE 4Γ—)

Model Studio

πŸš€ ORCH Fusion (272M)

Compact code-gen model trained from scratch on consumer hardware (RTX 3060 12GB).

Spec Value
Parameters 272.7M
Architecture Custom LLaMA-style
Layers / Hidden 24 / 1,024
GQA 16 heads / 4 KV
Vocab 2,103 (tiny custom)

Benchmark (ORCH-ProjectBench): 76.6 overall Β· 95.3 code parse Β· 93.3 format

Model

🩺 MedLLM-10M

GPT-2 style language model trained from scratch on medical literature.

Spec Value
Parameters ~27.7M (10M body)
Architecture GPT-2
Layers / Hidden 8 / 512
Heads / FFN 8 / 2,048
Vocab 5,000 (custom)
Context 512
Hardware RTX 3060 12GB

⚠️ Research / educational use only. Not for clinical decision-making.

Model

Also published: ORCH Next.js 350M v2 (287M, from scratch with 16k vocab).


Tech Stack

AI & ML

PyTorch Hugging Face Transformers QLoRA CUDA Gradio

Frontend

React Next.js TypeScript Tailwind

Backend

Python Laravel Node.js FastAPI

Cloud & DevOps

AWS Docker RunPod Linux


Selected Live Products

Product What it does
ClarioScope AI HIPAA-compliant healthcare practice growth platform (CTO)
BeautyCrew AI Booking management for the beauty industry β€” prevents missed appointments
VETR Proposal AI-assisted federal contracting co-pilot for small business teams
CommonRoom AI Collaborative digital workspace β€” 15 group-coordination tools, no install
ORCH Studio Generate complete Next.js apps from natural language (powered by ORCH-7B)

Open Source

  • orch-ai β€” Hugging Face org for the ORCH code-generation model family
  • clarioscope-ai β€” ClarioScope AI's Hugging Face org
  • Configs, tokenizers, and training details are public on every model card

πŸ“« Get in touch: raihan@clarioscope.ai Β· Portfolio

About

Config files for my GitHub profile.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors