Skip to content

ethanrackleff/Classifying_Mathematical_Information_with_LLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

COSC 426 Final Project:

This repository contains our:

  • sample data
  • config files for models to execute steps
  • bash scripts for submitting jobs to turing

This README contains instructions for running our experiment on sample data. For the purpose of our proposal, these steps are all run on the same data set. In our final project, we will have separate datasets for our training, validation, and evaluation. We also provide config files and bash scripts to run our experiment with a single model, however we will be performing this using more than one in our final product.

Note: File paths in this sample example were written to run on turing with file path based on our local setup file structure. File paths in config files and bash files may need to be reconfigured since files are all in this one folder on this repository.

Step 1: Train models using the train and validation data set

Submit training job to Turing. Run the train config file using this command:

qsub bashscript_finalproject_train.pbs

This command will run this pbs file that executes our main.py function with the "train" config file. This will give us the fine-tuned model in the path specified in the config file.

Step 2: Use TextClassification models on the evaluation data set

Run the train config file using this command:

qsub bashscript_finalproject_evaluate.pbs

This command will run this pbs file that executes our main.py function with the "evaluate" config file. This will generate a predictions file that we will use in Step 3

Step 3: Analyze output from model evaluation

Run the train config file using this command:

qsub bashscript_finalproject_evaluate.pbs

After this step, we should have our results data that is ready for our own analysis

Note: NLP Scholar toolkit may not be programmed to properly run analyze at this time. If this is the case, and this step is unable to produce automatic analysis, we will do those calculations/assessments with an alternate method.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors