AGENTS.md

Project Overview

PhiCookBook is a comprehensive cookbook repository containing hands-on examples, tutorials, and documentation for working with Microsoft's Phi family of Small Language Models (SLMs). The repository demonstrates various use cases including inference, fine-tuning, quantization, RAG implementations, and multimodal applications across different platforms and frameworks.

Key Technologies:

Languages: Python, C#/.NET, JavaScript/Node.js
Frameworks: ONNX Runtime, PyTorch, Transformers, MLX, OpenVINO, Semantic Kernel
Platforms: Microsoft Foundry, GitHub Models, Hugging Face, Ollama
Model Types: Phi-3, Phi-3.5, Phi-4 (text, vision, multimodal, reasoning variants)

Repository Structure:

/code/ - Working code examples and sample implementations
/md/ - Detailed documentation, tutorials, and how-to guides
/translations/ - Multi-language translations (50+ languages via automated workflow)
/.devcontainer/ - Dev container configuration (Python 3.12 with Ollama)

Development Environment Setup

Using GitHub Codespaces or Dev Containers (Recommended)

Open in GitHub Codespaces (fastest):
- Click the "Open in GitHub Codespaces" badge in README
- Container auto-configures with Python 3.12 and Ollama with Phi-3
Open in VS Code Dev Containers:
- Use the "Open in Dev Containers" badge from README
- Container requires 16GB host memory minimum

Local Setup

Prerequisites:

Python 3.12 or later
.NET 8.0 SDK (for C# examples)
Node.js 18+ and npm (for JavaScript examples)
16GB RAM minimum recommended

Installation:

git clone https://github.com/microsoft/PhiCookBook.git
cd PhiCookBook

For Python Examples: Navigate to specific example directories and install dependencies:

cd code/<example-directory>
pip install -r requirements.txt  # if requirements.txt exists

For .NET Examples:

cd md/04.HOL/dotnet/src
dotnet restore LabsPhi.sln
dotnet build LabsPhi.sln

For JavaScript/Web Examples:

cd code/08.RAG/rag_webgpu_chat
npm install
npm run dev  # Start development server
npm run build  # Build for production

Repository Organization

Code Examples (`/code/`)

01.Introduce/ - Basic introductions and getting started samples
03.Finetuning/ and 04.Finetuning/ - Fine-tuning examples with various methods
03.Inference/ - Inference examples on different hardware (AIPC, MLX)
06.E2E/ - End-to-end application samples
07.Lab/ - Laboratory/experimental implementations
08.RAG/ - Retrieval-Augmented Generation samples
09.UpdateSamples/ - Latest updated samples

Documentation (`/md/`)

01.Introduction/ - Intro guides, environment setup, platform guides
02.Application/ - Application samples organized by type (Text, Code, Vision, Audio, etc.)
02.QuickStart/ - Quick start guides for Microsoft Foundry and GitHub Models
03.FineTuning/ - Fine-tuning documentation and tutorials
04.HOL/ - Hands-on labs (includes .NET examples)

File Formats

Jupyter Notebooks (.ipynb) - Interactive Python tutorials marked with 📓 in README
Python Scripts (.py) - Standalone Python examples
C# Projects (.csproj, .sln) - .NET applications and samples
JavaScript (.js, package.json) - Web-based and Node.js examples
Markdown (.md) - Documentation and guides

Working with Examples

Running Jupyter Notebooks

Most examples are provided as Jupyter notebooks:

pip install jupyter notebook
jupyter notebook  # Opens browser interface
# Navigate to desired .ipynb file

Running Python Scripts

cd code/<example-directory>
pip install -r requirements.txt
python <script-name>.py

Running .NET Examples

cd md/04.HOL/dotnet/src/<project-name>
dotnet run

Or build entire solution:

cd md/04.HOL/dotnet/src
dotnet run --project <project-name>

Running JavaScript/Web Examples

cd code/08.RAG/rag_webgpu_chat
npm install
npm run dev  # Development with hot reload

Testing

This repository contains example code and tutorials rather than a traditional software project with unit tests. Validation is typically done by:

Running the examples - Each example should execute without errors
Verifying outputs - Check that model responses are appropriate
Following tutorials - Step-through guides should work as documented

Common validation approach:

Test example execution in the target environment
Verify dependencies install correctly
Check that model downloads/loads successfully
Confirm expected behavior matches documentation

Code Style and Conventions

General Guidelines

Examples should be clear, well-commented, and educational
Follow language-specific conventions (PEP 8 for Python, C# standards for .NET)
Keep examples focused on demonstrating specific Phi model capabilities
Include comments explaining key concepts and model-specific parameters

Documentation Standards

URL Formatting:

Use [text](url) format without extra spaces
Relative links: Use ./ for current directory, ../ for parent
No country-specific locales in URLs (avoid /en-us/, /en/)

Images:

Store all images in /imgs/ directory
Use descriptive names with English characters, numbers, and dashes
Example: phi-3-architecture.png

Markdown Files:

Reference actual working examples in /code/ directory
Keep documentation synchronized with code changes
Use 📓 emoji to mark Jupyter notebook links in README

File Organization

Code examples in /code/ organized by topic/feature
Documentation in /md/ mirrors code structure when applicable
Keep related files (notebooks, scripts, configs) together in subdirectories

Pull Request Guidelines

Before Submitting

Fork the repository to your account
Separate PRs by type:
- Bug fixes in one PR
- Documentation updates in another
- New examples in separate PRs
- Typo fixes can be combined
Handle merge conflicts:
- Update your local main branch before making changes
- Sync with upstream frequently
Translation PRs:
- Must include translations for ALL files in the folder
- Maintain consistent structure with original language

Required Checks

PRs automatically run GitHub workflows to validate:

Relative path validation - All internal links must work
- Test links locally: Ctrl+Click in VS Code
- Use path suggestions from VS Code (./ or ../)
URL locale check - Web URLs must not contain country locales
- Remove /en-us/, /en/, or other language codes
- Use generic international URLs
Broken URL check - All URLs must return 200 status
- Verify links are accessible before submitting
- Note: Some failures may be due to network restrictions

PR Title Format

[component] Brief description

Examples:

[docs] Add Phi-4 inference tutorial
[code] Fix ONNX Runtime integration example
[translation] Add Japanese translation for intro guides

Common Development Patterns

Working with Phi Models

Model Loading:

Examples use various frameworks: Transformers, ONNX Runtime, MLX, OpenVINO
Models are typically downloaded from Hugging Face, Azure, or GitHub Models
Check model compatibility with your hardware (CPU, GPU, NPU)

Inference Patterns:

Text generation: Most examples use chat/instruct variants
Vision: Phi-3-vision and Phi-4-multimodal for image understanding
Audio: Phi-4-multimodal supports audio inputs
Reasoning: Phi-4-reasoning variants for advanced reasoning tasks

Platform-Specific Notes

Microsoft Foundry:

Requires Azure subscription and API keys
See /md/02.QuickStart/AzureAIFoundry_QuickStart.md

GitHub Models:

Free tier available for testing
See /md/02.QuickStart/GitHubModel_QuickStart.md

Local Inference:

ONNX Runtime: Cross-platform, optimized inference
Ollama: Easy local model management (pre-configured in dev container)
Apple MLX: Optimized for Apple Silicon

Troubleshooting

Common Issues

Memory Issues:

Phi models require significant RAM (especially vision/multimodal variants)
Use quantized models for resource-constrained environments
See /md/01.Introduction/04/QuantifyingPhi.md

Dependency Conflicts:

Python examples may have specific version requirements
Use virtual environments for each example
Check individual requirements.txt files

Model Download Failures:

Large models may timeout on slow connections
Consider using cloud environments (Codespaces, Azure)
Check Hugging Face cache: ~/.cache/huggingface/

.NET Project Issues:

Ensure .NET 8.0 SDK is installed
Use dotnet restore before building
Some projects have CUDA-specific configurations (Debug_Cuda)

JavaScript/Web Examples:

Use Node.js 18+ for compatibility
Clear node_modules and reinstall if issues persist
Check browser console for WebGPU compatibility issues

Getting Help

Discord: Join the Microsoft Foundry Community Discord
GitHub Issues: Report bugs and issues in the repository
GitHub Discussions: Ask questions and share knowledge

Additional Context

Responsible AI

All Phi model usage should follow Microsoft's Responsible AI principles:

Fairness, reliability, safety
Privacy and security
Inclusiveness, transparency, accountability
Use Azure AI Content Safety for production applications
See /md/01.Introduction/01/01.AISafety.md

Translations

50+ languages supported via automated GitHub Action
Translations in /translations/ directory
Maintained by co-op-translator workflow
Do not manually edit translated files (auto-generated)

Contributing

Follow guidelines in CONTRIBUTING.md
Agree to Contributor License Agreement (CLA)
Adhere to Microsoft Open Source Code of Conduct
Keep security and credentials out of commits

Multi-Language Support

This is a polyglot repository with examples in:

Python - ML/AI workflows, Jupyter notebooks, fine-tuning
C#/.NET - Enterprise applications, ONNX Runtime integration
JavaScript - Web-based AI, browser inference with WebGPU

Choose the language that best fits your use case and deployment target.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AGENTS.md

Project Overview

Development Environment Setup

Using GitHub Codespaces or Dev Containers (Recommended)

Local Setup

Repository Organization

Code Examples (`/code/`)

Documentation (`/md/`)

File Formats

Working with Examples

Running Jupyter Notebooks

Running Python Scripts

Running .NET Examples

Running JavaScript/Web Examples

Testing

Code Style and Conventions

General Guidelines

Documentation Standards

File Organization

Pull Request Guidelines

Before Submitting

Required Checks

PR Title Format

Common Development Patterns

Working with Phi Models

Platform-Specific Notes

Troubleshooting

Common Issues

Getting Help

Additional Context

Responsible AI

Translations

Contributing

Multi-Language Support

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AGENTS.md

Project Overview

Development Environment Setup

Using GitHub Codespaces or Dev Containers (Recommended)

Local Setup

Repository Organization

Code Examples (/code/)

Documentation (/md/)

File Formats

Working with Examples

Running Jupyter Notebooks

Running Python Scripts

Running .NET Examples

Running JavaScript/Web Examples

Testing

Code Style and Conventions

General Guidelines

Documentation Standards

File Organization

Pull Request Guidelines

Before Submitting

Required Checks

PR Title Format

Common Development Patterns

Working with Phi Models

Platform-Specific Notes

Troubleshooting

Common Issues

Getting Help

Additional Context

Responsible AI

Translations

Contributing

Multi-Language Support

Code Examples (`/code/`)

Documentation (`/md/`)