In the README under "Getting Dataset" > 2. A directory containing the .html files (processed .tex files by LaTeXML) with the same folder structure How do I get .tex files from pdfs?
In the README under "Getting Dataset"
How do I get .tex files from pdfs?