Merge the files from the papers in the PeerRead dataset into a single file. Note that only entries that have ratings in their reviews will be used.
- Download the dataset from Google Drive.
$ uv run paper peerread download data/PeerRead- Preprocess the dataset.
$ uv run paper peerread preprocess data/PeerRead output/peerread_merged.jsonThe paths can be changed via CLI arguments. See peerread --help for more information.