Skip to content
View raihan-js's full-sized avatar
🔥
Working from home
🔥
Working from home

Block or report raihan-js

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
raihan-js/README.md

Hey there, I'm Raihan 👋

AI/ML Engineer · CTO at ClarioScope AI · From Bangladesh

Train small language models (SLMs) from scratch · Fine-tune larger ones with QLoRA · Ship production AI products

Portfolio Hugging Face LinkedIn Email


What I'm Working On

  • 🧠 Training small language models from scratch — the ORCH series (350M–3B) for Next.js code generation, and MedLLM-10M for medical applications
  • 🔧 Fine-tuning larger base models with QLoRA — ORCH-7B is a 4-bit fine-tune of DeepSeek Coder 6.7B
  • 🎯 Building benchmark-grade specialist SLMsclarioscope-intent-deberta-v1 matches frontier LLMs within 4 pp of accuracy at 22× lower latency and $0/inference (dev.to writeup)
  • 🏥 Leading engineering at ClarioScope AI — HIPAA-compliant healthcare practice growth platform
  • 🚀 Shipping production AI productsBeautyCrew AI, VETR Proposal, CommonRoom AI
  • 📚 Open-source everything — all model weights, configs, and tokenizers are public on Hugging Face

🏆 Latest ship: the full ClarioScope SLM Suite (all three models shipped)

A three-model intake intelligence pipeline for healthcare practices. Each model is small, specialized, and benchmarked head-to-head against frontier APIs. Suite-level writeup: Three small models for healthcare intake — and what shipping all three taught me.

Model Task Size Headline result Speed vs frontier Cost / 1K Links
clarioscope-intent-deberta-v1 7-class intent classification 184M 91.16% accuracy (within 4 pp of Claude Haiku) 22× faster $0 🤗 · 📝
clarioscope-phi-deberta-v1 18-category HIPAA PHI span detection 125M Macro F1 0.63 (triples frontier on LOC, ties on NAME/DATE/PHONE/IP/AGE) 45× faster $0 🤗 · 📝
clarioscope-insurance-v1 12-field insurance / billing extraction 125M Macro F1 0.79 (ties GPT-4o on SUBSCRIBER_NAME, within 5–13 pp on the four highest-volume fields) 26× faster $0 🤗 · 📝

Total cost to build all three: ~$16 in OpenAI + RunPod + benchmark API spend. Total infrastructure: Hugging Face (free) + RunPod spot pods (a few cents per run).

The recurring pattern across all three: small specialized models don't replace frontier APIs — they're stage one of a hybrid pipeline that does the bulk-volume work cheaply, then defers a small fraction of hard cases to a frontier API. All three model cards include honest per-entity / per-class breakdowns showing where the small model wins and loses.

Per-entity F1 — PHI detector vs frontier APIs


AI Models I've Trained

All published openly on 🤗 Hugging Face. Configs and tokenizers included.

🎼 ORCH Next.js 3B

A 3 billion parameter decoder-only transformer trained from scratch for full-stack Next.js code generation.

Spec Value
Parameters ~3.0B
Architecture Custom LLaMA-style
Layers / Hidden 32 / 2,560
Attention / KV (GQA) 32 / 8
Vocab 32,000 (custom)
Context 16,384 tokens
Hardware NVIDIA A40 48GB (RunPod)

Model

🔧 ORCH-7B

QLoRA fine-tune of DeepSeek Coder 6.7B Instruct, specialized for autonomous Next.js generation.

Spec Value
Base model DeepSeek Coder 6.7B Instruct
Method QLoRA (4-bit NF4 + LoRA)
Training 43h on a single A100
Steps 5,238
Context 16,384 (linear RoPE 4×)

Model Studio

🚀 ORCH Fusion (272M)

Compact code-gen model trained from scratch on consumer hardware (RTX 3060 12GB).

Spec Value
Parameters 272.7M
Architecture Custom LLaMA-style
Layers / Hidden 24 / 1,024
GQA 16 heads / 4 KV
Vocab 2,103 (tiny custom)

Benchmark (ORCH-ProjectBench): 76.6 overall · 95.3 code parse · 93.3 format

Model

🩺 MedLLM-10M

GPT-2 style language model trained from scratch on medical literature.

Spec Value
Parameters ~27.7M (10M body)
Architecture GPT-2
Layers / Hidden 8 / 512
Heads / FFN 8 / 2,048
Vocab 5,000 (custom)
Context 512
Hardware RTX 3060 12GB

⚠️ Research / educational use only. Not for clinical decision-making.

Model

Also published: ORCH Next.js 350M v2 (287M, from scratch with 16k vocab).


Tech Stack

AI & ML

PyTorch Hugging Face Transformers QLoRA CUDA Gradio

Frontend

React Next.js TypeScript Tailwind

Backend

Python Laravel Node.js FastAPI

Cloud & DevOps

AWS Docker RunPod Linux


Selected Live Products

Product What it does
ClarioScope AI HIPAA-compliant healthcare practice growth platform (CTO)
BeautyCrew AI Booking management for the beauty industry — prevents missed appointments
VETR Proposal AI-assisted federal contracting co-pilot for small business teams
CommonRoom AI Collaborative digital workspace — 15 group-coordination tools, no install
ORCH Studio Generate complete Next.js apps from natural language (powered by ORCH-7B)

Open Source

  • orch-ai — Hugging Face org for the ORCH code-generation model family
  • clarioscope-ai — ClarioScope AI's Hugging Face org
  • Configs, tokenizers, and training details are public on every model card

📫 Get in touch: raihan@clarioscope.ai · Portfolio

Pinned Loading

  1. 33-js-concepts 33-js-concepts Public

    Forked from leonardomso/33-js-concepts

    📜 33 JavaScript concepts every developer should know.

    JavaScript

  2. GPT-NEO-1.3B GPT-NEO-1.3B Public

    Python

  3. docs docs Public

    Forked from laravel/docs

    The Laravel documentation.

  4. yolov5-onnx-acnedataset yolov5-onnx-acnedataset Public

    HTML

  5. LLPhant/LLPhant LLPhant/LLPhant Public

    LLPhant - A comprehensive PHP Generative AI Framework using OpenAI GPT 4. Inspired by Langchain

    PHP 1.7k 165