Raihan raihan-js

Hey there, I'm Raihan 👋

AI/ML Engineer · CTO at ClarioScope AI · From Bangladesh

Train small language models (SLMs) from scratch · Fine-tune larger ones with QLoRA · Ship production AI products

What I'm Working On

🧠 Training small language models from scratch — the ORCH series (350M–3B) for Next.js code generation, and MedLLM-10M for medical applications
🔧 Fine-tuning larger base models with QLoRA — ORCH-7B is a 4-bit fine-tune of DeepSeek Coder 6.7B
🎯 Building benchmark-grade specialist SLMs — clarioscope-intent-deberta-v1 matches frontier LLMs within 4 pp of accuracy at 22× lower latency and $0/inference (dev.to writeup)
🏥 Leading engineering at ClarioScope AI — HIPAA-compliant healthcare practice growth platform
🚀 Shipping production AI products — BeautyCrew AI, VETR Proposal, CommonRoom AI
📚 Open-source everything — all model weights, configs, and tokenizers are public on Hugging Face

🏆 Latest ship: the full ClarioScope SLM Suite (all three models shipped)

A three-model intake intelligence pipeline for healthcare practices. Each model is small, specialized, and benchmarked head-to-head against frontier APIs. Suite-level writeup: Three small models for healthcare intake — and what shipping all three taught me.

Model	Task	Size	Headline result	Speed vs frontier	Cost / 1K	Links
clarioscope-intent-deberta-v1	7-class intent classification	184M	91.16% accuracy (within 4 pp of Claude Haiku)	22× faster	$0	🤗 · 📝
clarioscope-phi-deberta-v1	18-category HIPAA PHI span detection	125M	Macro F1 0.63 (triples frontier on `LOC`, ties on `NAME`/`DATE`/`PHONE`/`IP`/`AGE`)	45× faster	$0	🤗 · 📝
clarioscope-insurance-v1	12-field insurance / billing extraction	125M	Macro F1 0.79 (ties GPT-4o on `SUBSCRIBER_NAME`, within 5–13 pp on the four highest-volume fields)	26× faster	$0	🤗 · 📝

Total cost to build all three: ~$16 in OpenAI + RunPod + benchmark API spend. Total infrastructure: Hugging Face (free) + RunPod spot pods (a few cents per run).

The recurring pattern across all three: small specialized models don't replace frontier APIs — they're stage one of a hybrid pipeline that does the bulk-volume work cheaply, then defers a small fraction of hard cases to a frontier API. All three model cards include honest per-entity / per-class breakdowns showing where the small model wins and loses.

AI Models I've Trained

All published openly on 🤗 Hugging Face. Configs and tokenizers included.

🎼 ORCH Next.js 3B

A 3 billion parameter decoder-only transformer trained from scratch for full-stack Next.js code generation.

Spec	Value
Parameters	~3.0B
Architecture	Custom LLaMA-style
Layers / Hidden	32 / 2,560
Attention / KV (GQA)	32 / 8
Vocab	32,000 (custom)
Context	16,384 tokens
Hardware	NVIDIA A40 48GB (RunPod)

🔧 ORCH-7B

QLoRA fine-tune of DeepSeek Coder 6.7B Instruct, specialized for autonomous Next.js generation.

Spec	Value
Base model	DeepSeek Coder 6.7B Instruct
Method	QLoRA (4-bit NF4 + LoRA)
Training	43h on a single A100
Steps	5,238
Context	16,384 (linear RoPE 4×)

🚀 ORCH Fusion (272M)

Compact code-gen model trained from scratch on consumer hardware (RTX 3060 12GB).

Spec	Value
Parameters	272.7M
Architecture	Custom LLaMA-style
Layers / Hidden	24 / 1,024
GQA	16 heads / 4 KV
Vocab	2,103 (tiny custom)

Benchmark (ORCH-ProjectBench): 76.6 overall · 95.3 code parse · 93.3 format

🩺 MedLLM-10M

GPT-2 style language model trained from scratch on medical literature.

Spec	Value
Parameters	~27.7M (10M body)
Architecture	GPT-2
Layers / Hidden	8 / 512
Heads / FFN	8 / 2,048
Vocab	5,000 (custom)
Context	512
Hardware	RTX 3060 12GB

⚠️ Research / educational use only. Not for clinical decision-making.

Also published: ORCH Next.js 350M v2 (287M, from scratch with 16k vocab).

Tech Stack

AI & ML

Frontend

Backend

Cloud & DevOps

Selected Live Products

Product	What it does
ClarioScope AI	HIPAA-compliant healthcare practice growth platform (CTO)
BeautyCrew AI	Booking management for the beauty industry — prevents missed appointments
VETR Proposal	AI-assisted federal contracting co-pilot for small business teams
CommonRoom AI	Collaborative digital workspace — 15 group-coordination tools, no install
ORCH Studio	Generate complete Next.js apps from natural language (powered by ORCH-7B)

Open Source

orch-ai — Hugging Face org for the ORCH code-generation model family
clarioscope-ai — ClarioScope AI's Hugging Face org
Configs, tokenizers, and training details are public on every model card

📫 Get in touch: raihan@clarioscope.ai · Portfolio

Provide feedback

Saved searches

Use saved searches to filter your results more quickly