Published 7/16/2024
Using HuggingFace Datasets Offline
How to save a HuggingFace dataset to disk and use it offline
This is pretty simple, but quite helpful if you're running jobs on a compute node that doesn't have internet access.
On the login node or another machine with internet access, run the following Python code:
Then, if needed, copy the files to the machine running your job. Now, from that offline machine, loading the dataset is simple!