Skip to content
View SepehrRezaee's full-sized avatar
🌍
Hello World!!
🌍
Hello World!!

Highlights

  • Pro

Block or report SepehrRezaee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SepehrRezaee/README.md

Hi, I'm Sepehr Rezaee 👋

Senior AI Architect / Senior LLM Engineer building production-grade agentic LLM systems, RAG platforms, AI market-intelligence pipelines, and multi-agent automation systems.

I focus on transforming real-world business and data problems into reliable AI products: routing, retrieval, context assembly, structured outputs, eval loops, observability, and production deployment. My work sits at the intersection of LLM platform engineering, multi-agent orchestration, AI safety, and market/decision intelligence systems.

  • 🧠 5+ years building AI, ML, and production LLM systems
  • 🏢 Senior LLM Engineer @ AIR Property, Dubai
  • 🤖 Former AI Engineer, Agentic Systems @ PropTy Global
  • 🏗️ Former Chief AI Officer & Multi-Agent Architect @ Novel Mind Scientist
  • 🔬 Research Intern, Safe & Generative AI @ Mathis Lab, EPFL
  • 🎓 B.Sc. in Computer Science, Shahid Beheshti University

📄 See my full CV


What I Build

I design AI systems that are not just chatbots, but controlled intelligence pipelines with clear routing, evidence handling, structured outputs, and measurable quality.

  • Agentic AI Systems
    Multi-agent architectures with task routing, memory, tool orchestration, handoff logic, fallback strategies, and production guardrails.

  • RAG & Retrieval Platforms
    Retrieval pipelines with vector databases, hybrid search, reranking, context assembly, caching, evaluation, and governance.

  • AI Market Intelligence Systems
    Pipelines for collecting, structuring, and reasoning over external data sources such as market data, financial signals, crypto/on-chain data, news, and domain-specific knowledge.

  • LLM Evaluation & Observability
    Evals, tracing, prompt/context/output logging, quality metrics, regression tests, dashboards, and feedback loops tied to product KPIs.

  • AI Safety & Security
    Red-team evaluation, prompt/tool governance, fallback policies, robust evaluation, and research-to-production security practices.


Current Focus

  • Production RAG + agent services
  • Routing-first LLM architectures instead of one universal agent
  • Structured-output pipelines for reliable automation
  • AI observability with tracing, logging, evals, and debugging workflows
  • Market-intelligence and decision-support systems powered by LLMs
  • Research-to-production transfer in AI safety and model security

Selected Impact

  • Productized reusable multi-agent RAG platforms with standardized prompts, tools, memory, evaluations, and guardrails
  • Built production agentic workflows for autonomous recommendations and business decision support
  • Achieved 85%+ end-to-end task completion in real-world business workflows
  • Designed SLO-driven AI services with tracing, dashboards, fallback trees, and safety metrics
  • Improved reliability under peak load using vector caching, retrieval tuning, and inference optimization
  • Built eval/feedback loops connected to live KPIs for continuous improvement and drift monitoring
  • Co-authored AI safety research accepted at ICCV 2025 and NeurIPS 2024

Experience

Senior LLM Engineer — AIR Property, Dubai

Aug 2025 – Apr 2026

  • Architected and shipped production LLM services across RAG, agents, and decision-support workflows
  • Owned model selection, prompt/agent design, retrieval strategy, evaluation harnesses, and fallback trees
  • Built governed AI toolchains with tool registries, policies, approvals, and reusable architecture patterns
  • Implemented reliability workflows with SLOs, error budgets, monitoring, tracing, and safety metrics
  • Optimized retrieval and inference through data curation, vector caching, routing improvements, and capacity planning

AI Engineer, Agentic Systems — PropTy Global, Dubai

Aug 2024 – Sep 2025

  • Architected multi-agent systems using LangChain, custom orchestration, and RAG pipelines
  • Built context-aware planning with agent-to-agent protocols, memory, dynamic routing, and escalation paths
  • Productionized AI services with Docker, Kubernetes, Prometheus/Grafana, and centralized logging
  • Closed the loop with automated evaluation and feedback tied to live business KPIs

Chief AI Officer & Multi-Agent Architect — Novel Mind Scientist, Tehran

Oct 2022 – Sep 2025

  • Led delivery of LLM-powered agents across SaaS, healthcare, and education domains
  • Designed multi-agent automation pipelines using LangChain, Celery, APIs, and knowledge systems
  • Established engineering standards including design docs, ADRs, evaluation protocols, onboarding guides, and review processes
  • Integrated text, vision, and knowledge-graph components into reliable AI services

Research Intern, Safe & Generative AI — Mathis Lab, EPFL

May 2025 – Sep 2025

  • Co-authored ICCV 2025 accepted work on data-free diffusion-based trigger inversion for Trojaned models
  • Built latent-diffusion pipelines with classifier-guided feedback for exposing adversarial vulnerabilities
  • Developed zero-shot and data-free defense methods with large-scale benchmarking and robust evaluation practices

Research Assistant — Agentic AI & Security

Sharif University & Shahid Beheshti University | 2023 – 2025

  • Prototyped secure agentic ML pipelines with RAG, routing, memory management, and evaluation loops
  • Worked on AI model security, agent evaluation, and optimization research
  • Contributed to publication-driven research and mentored junior engineers

Project Manager, Agentic ML SaaS — NovaVira, Tehran

Mar 2023 – Feb 2024

  • Delivered a Django-based agentic recommender system with hybrid search and automated workflows
  • Led Agile delivery, CI/CD, and platform iteration for production AI features

Core Skills

AI / LLM Systems

  • Agentic AI
  • Multi-Agent Systems
  • RAG
  • Routing & Context Assembly
  • Structured Outputs
  • Prompt Engineering
  • LLM Evaluation
  • Safety Guardrails
  • Memory Systems
  • Tool Orchestration
  • LangChain
  • LangGraph
  • LlamaIndex

Evaluation / Observability

  • Eval Harnesses
  • Prompt/Context/Output Logging
  • Tracing
  • Regression Testing
  • SLOs & Error Budgets
  • Drift Monitoring
  • Prometheus
  • Grafana
  • ELK
  • Langfuse-style Observability

Backend / Infrastructure

  • Python
  • FastAPI
  • Flask
  • REST / GraphQL
  • Docker
  • Kubernetes
  • Celery
  • Redis
  • MLflow
  • Airflow
  • CI/CD

Data / Storage

  • PostgreSQL
  • MongoDB
  • Redis
  • Pinecone
  • Weaviate
  • Chroma
  • pgvector
  • Neo4j

Cloud / Platforms

  • AWS
  • GCP
  • Azure

Programming Languages

  • Python
  • C++
  • Java
  • C#

Architecture Highlights

  • Routing-First Agent Systems
    Designing workflows where user intent is routed to specialized pipelines instead of relying on one generic agent for every task.

  • Multi-Agent RAG Platforms
    Standardized prompts, tools, memory, retrieval, evals, fallback policies, and guardrails for reusable AI services.

  • Context Assembly & Structured Outputs
    Building controlled pipelines that retrieve, rank, compress, format, and validate context before producing machine-readable outputs.

  • Evaluation & Optimization
    Prompt A/B testing, routing comparisons, retrieval-quality checks, automated evals, and KPI-connected feedback loops.

  • Safety & Governance
    Prompt/tool policies, red-team evaluations, fallback strategies, incident-response-ready production patterns, and robust AI security practices.


Selected Projects

Multi-Agent RAG Platform

A reusable platform for building agentic AI services with routing, retrieval, memory, tool orchestration, evaluation, and guardrails.

Core ideas: LangChain/LangGraph-style orchestration, vector search, prompt governance, fallback trees, eval harnesses, and observability.

AI Market Intelligence Pipeline

A decision-intelligence architecture for collecting and analyzing market, news, crypto, and domain-specific signals through specialized LLM pipelines.

Core ideas: external data ingestion, retrieval, scenario-specific pipelines, structured outputs, signal summarization, tracing, and evaluation.

AI Model Security Research

Research and engineering work on Trojan/backdoor detection, robust evaluation, and data-free defenses for safer AI systems.

Core ideas: diffusion-based inversion, adversarial scanning, benchmark evaluation, zero-shot defenses, and production risk translation.


Publications

  • DISTIL: Data-Free Inversion of Suspicious Trojan Inputs via Latent Diffusion — ICCV 2025 (accepted)
  • Scanning Trojaned Models Using Out-of-Distribution Samples — NeurIPS 2024 (accepted)
  • Comparison of Pre-Training and Classification Models for Early Detection of Alzheimer’s Disease Using MRI — I4C 2023

Awards

  • Best Ideator — National Young Scientists Festival (2023)
  • Top 0.2% National Entrance Exam — Rank 352 / 150,000 (2020)

Education

B.Sc. in Computer Science
Shahid Beheshti University, Tehran
2021 – 2025
GPA: 3.4 / 4.0


Languages

  • Persian — Native
  • English — Professional

Connect


Profile views

GitHub stats

Top Langs

Pinned Loading

  1. omani_therapist omani_therapist Public

    Python 2

  2. An-implementation-of-Semantic-IMDB-Regression-using-TensorFlow An-implementation-of-Semantic-IMDB-Regression-using-TensorFlow Public

    Jupyter Notebook 3

  3. ImageProcessing_CourseOf_ShahidBeheshtiUniversity ImageProcessing_CourseOf_ShahidBeheshtiUniversity Public

    This repository is about the assignments of Image Processing course of Shahid Beheshti University with its implementations

    Jupyter Notebook 2 1

  4. deep-learning-symbolic-mathematics deep-learning-symbolic-mathematics Public

    Forked from elia-mercatanti/deep-learning-symbolic-mathematics

    Discussion and test of the first successful approach to solving symbolic mathematics problems through the use of neural networks, proposed for the first time by two Facebook researchers, Guillaume …

    Jupyter Notebook 3

  5. neural-netwroks-from-scratch neural-netwroks-from-scratch Public

    Forked from alirezaafzalaghaei/neural-networks-from-scratch

    implementing some neural networks in python

    Jupyter Notebook 3

  6. NN-Fall-2022 NN-Fall-2022 Public

    This repository is about the neural netowrk course that helding in Shahid Beheshti University

    Jupyter Notebook 2