Interactive web interface for the Inference Lab LLM inference simulator. Visualize and analyze LLM serving performance in real-time.
- Interactive Configuration: Edit simulation parameters in a visual editor
- Real-time Visualization: Watch simulations run with live charts
- Performance Metrics: View TTFT, E2E latency, throughput, and utilization
- Time Series Charts: Track metrics over simulation time
- Latency Distribution: Visualize P50, P90, P95, P99 percentiles
- Export Results: Download simulation data and configurations
npm install
npm run devOpen http://localhost:5173 in your browser.
npm run build
npm run previewThe production build will be in the dist/ directory.
This project depends on the @doubleword/inference-lab package, which provides the WebAssembly simulator core.
{
"dependencies": {
"@doubleword/inference-lab": "^0.1.0"
}
}- Load or Edit Configuration: Use the built-in editor to customize hardware, model, scheduler, and workload settings
- Run Simulation: Click "Run Simulation" and watch real-time progress
- Adjust Speed: Use the speed slider (1x, 10x, 100x, MAX)
- View Results: Explore charts showing latency, throughput, and utilization
- Export Data: Download results for further analysis
- React 19: UI framework
- TypeScript: Type-safe development
- Vite: Fast build tool and dev server
- Chart.js: Interactive charts and visualizations
- WebAssembly: High-performance simulation engine
inference-lab-web/
├── src/
│ ├── App.tsx # Main application component
│ ├── simulator.worker.ts # Web Worker for background simulation
│ ├── App.css # Styles
│ └── main.tsx # Entry point
├── public/ # Static assets
├── index.html # HTML template
├── vite.config.ts # Vite configuration
└── package.json # Dependencies and scripts
npm run dev- Start development servernpm run build- Build for productionnpm run preview- Preview production buildnpm run lint- Run ESLint
This application uses WebAssembly and requires a modern browser with:
- WebAssembly support
- ES2020+ features
- Web Workers
Tested on Chrome, Firefox, Safari, and Edge.
- Inference Lab - Core simulator (Rust/WASM)
MIT