ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle

Paper: ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle

Authors: Mihran Miroyan*, Rose Niousha*, Joseph E. Gonzalez, Gireeja Ranade, Narges Norouzi (UC Berkeley)

TL;DR We study student modeling through generating student-like code submissions. We find that fine-tuning is important for "unlearning" professional code and learn how to code like a student. Evaluation is important for capturing different aspects of student-like code.

Key Findings from ParaStudent

Evaluation metrics. We introduce a set of metrics, including code semantics, error type, and code style, to evaluate the realism of “student-like” code.
Sequential code modeling. We fine-tune on low- and high-resolution student code streams to simulate realistic learning trajectories on different levels of granularity.
Fine-tuning vs. prompting. We find that when models are fine-tuned on student code to specific homework problems, they outperform prompting-only models along the proposed set of metrics.

Please read our paper for more details. This repo contains scripts for fine-tuning, generation, and evaluation code. We do not release real / synthetic data and our data processing pipeline due to student privacy.

Repository Structure

Install all necessary dependencies: pip3 install -r requirements.txt

fine_tuning/: Fine-tuning prompt templates and the training script (see more details in fine_tuning/README.md)

data_generation/: Prompt templates and script for data generation (see more details in data_generation/README.md).

evals/: Scripts for running evaluations, computing metrics, and visualizing results (see more details in evals/README.md).

Citations

@misc{miroyan2025parastudentgeneratingevaluatingrealistic,
      title={ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle}, 
      author={Mihran Miroyan and Rose Niousha and Joseph E. Gonzalez and Gireeja Ranade and Narges Norouzi},
      year={2025},
      eprint={2507.12674},
      archivePrefix={arXiv},
      primaryClass={cs.CY},
      url={https://arxiv.org/abs/2507.12674}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle

Key Findings from ParaStudent

Repository Structure

Citations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data_generation		data_generation
evals		evals
fine_tuning		fine_tuning
.gitignore		.gitignore
README.md		README.md
logo.png		logo.png
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle

Key Findings from ParaStudent

Repository Structure

Citations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages