MAqeel151214

Muhammad Aqeel

AI / ML Engineer · Data Scientist · LLM Evaluation & Safety Research

Building data-driven systems, evaluating large language models, and designing robust ML pipelines with a focus on ethics, safety, and real-world impact.

👋 About Me

🎯 Aspiring AI / Machine Learning Engineer with strong foundations in data analysis and systems thinking
🧠 Actively working on LLM evaluation, safety, bias, and benchmarking frameworks
📊 Experienced with Python, Pandas, NumPy, and structured datasets (JSONL, CSV)
🔍 Interested in model behavior analysis, robustness, and ethical AI
🚀 Long-term goal: contribute to reliable and transparent AI systems

🛠️ Technical Skills

Programming & Data

Python, SQL, Java (academic & projects)
Pandas, NumPy, Matplotlib

Machine Learning & AI

Scikit-Learn
NLP preprocessing & data cleaning
Model evaluation & benchmarking pipelines

LLM & Evaluation Work

Safety / Ethics / Bias datasets (JSONL)
Rubric-based scoring systems
Model comparison & regression tracking (conceptual + implementation)

Tools & Platforms

Git & GitHub
Jupyter Notebook
Linux
MySQL (integration with Python & PHP)

🔬 Current Focus Areas

🧪 LLM benchmarking frameworks (safety, ethics, bias)
📐 Dataset schema design for evaluation tasks
🧠 Understanding reasoning, adversarial prompting, and failure modes
📊 Visualization & comparative analysis of model outputs

📌 Featured Work

Area	Description
LLM Evaluation Platform	Framework to benchmark LLMs on safety, ethics, and bias using structured JSONL datasets
Data Analysis Projects	Exploratory analysis, pivot tables, and visualizations using Pandas
NLP Data Cleaning	Annotation, preprocessing, and normalization of text data
Academic Projects	Java, assembly language fundamentals, and systems-level understanding

📍 Only repositories where I am the original author are considered primary work.

🧠 How I Think About Projects

✅ Prefer clean schemas and extensible designs
✅ Focus on evaluation, metrics, and behavior, not just accuracy
✅ Avoid unnecessary complexity (e.g., databases unless required)
✅ Build with future expansion in mind (reasoning, adversarial tests, compliance)

🌱 Learning Roadmap

Advanced NLP & LLM internals
Reasoning and chain-of-thought evaluation
Robust ML system design
Research-level benchmarking methodologies

📫 Connect

GitHub: @MAqeel151214
Open to collaboration on AI, ML, NLP, and evaluation research

"Build systems that can be trusted — not just systems that work."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly