COSC 426 Final Project:

This repository contains our:

sample data
config files for models to execute steps
bash scripts for submitting jobs to turing

This README contains instructions for running our experiment on sample data. For the purpose of our proposal, these steps are all run on the same data set. In our final project, we will have separate datasets for our training, validation, and evaluation. We also provide config files and bash scripts to run our experiment with a single model, however we will be performing this using more than one in our final product.

Note: File paths in this sample example were written to run on turing with file path based on our local setup file structure. File paths in config files and bash files may need to be reconfigured since files are all in this one folder on this repository.

Step 1: Train models using the train and validation data set

Submit training job to Turing. Run the train config file using this command:

qsub bashscript_finalproject_train.pbs

This command will run this pbs file that executes our main.py function with the "train" config file. This will give us the fine-tuned model in the path specified in the config file.

Step 2: Use TextClassification models on the evaluation data set

Run the train config file using this command:

qsub bashscript_finalproject_evaluate.pbs

This command will run this pbs file that executes our main.py function with the "evaluate" config file. This will generate a predictions file that we will use in Step 3

Step 3: Analyze output from model evaluation

Run the train config file using this command:

qsub bashscript_finalproject_evaluate.pbs

After this step, we should have our results data that is ready for our own analysis

Note: NLP Scholar toolkit may not be programmed to properly run analyze at this time. If this is the case, and this step is unable to produce automatic analysis, we will do those calculations/assessments with an alternate method.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Avg Calc.py		Avg Calc.py
Evaluation Metrics README (1).md		Evaluation Metrics README (1).md
README.md		README.md
bashscript_finalproject_analyze.pbs		bashscript_finalproject_analyze.pbs
bashscript_finalproject_evaluate.pbs		bashscript_finalproject_evaluate.pbs
bashscript_finalproject_train.pbs		bashscript_finalproject_train.pbs
config_finalproject_sample_analyze.yaml		config_finalproject_sample_analyze.yaml
config_finalproject_sample_evaluate.yaml		config_finalproject_sample_evaluate.yaml
config_finalproject_sample_train.yaml		config_finalproject_sample_train.yaml
finalproject_sample_dataset.tsv		finalproject_sample_dataset.tsv
main.py		main.py
predictions.tsv		predictions.tsv
sampleTrainData.txt		sampleTrainData.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COSC 426 Final Project:

Step 1: Train models using the train and validation data set

Step 2: Use TextClassification models on the evaluation data set

Step 3: Analyze output from model evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

COSC 426 Final Project:

Step 1: Train models using the train and validation data set

Step 2: Use TextClassification models on the evaluation data set

Step 3: Analyze output from model evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages