Posts

Thoughts on web development, AI, and building things that matter.

#ml/ai (12)#open-source (4)#random (4)#frontend (3)#linguistics (3)

Jul 1, 20253 min read

Dialects for Humans: Sounding Distinct from LLMs

Humans are developing new linguistic patterns to distinguish themselves from AI-generated content, and the rate of change will accelerate.

#ml/ai#linguistics

Jul 16, 20241 min read

Using HuggingFace Datasets Offline

How to save a HuggingFace dataset to disk and use it offline

#ml/ai

Jul 16, 20241 min read

Tips #1

Markdown detection in Google Docs, swiping between tabs in Brave Browser for iOS, and running TypeScript files from the command line.

#random#ml/ai

Nov 7, 20235 min read

Rebuilding Alpaca with the Hugging Face Trainer Class

Fine-tuning Llama-2-7B using the Alpaca dataset and Hugging Face Trainer

#ml/ai#open-source

Oct 16, 20232 min read

Introducing gom: GPU Monitoring across Containers

I published `gom`, a CLI tool for monitoring GPU usage across Docker containers.

#ml/ai#open-source

Sep 11, 20234 min read

Enroot on Slurm for Distributed ML: Part 2

How to use Enroot on Slurm for containerized multi-node training.

#ml/ai

Sep 8, 20232 min read

Enroot on Slurm for Distributed ML: Part 1

How to use Enroot on Slurm for containerized multi-node training.

#ml/ai

Sep 8, 20232 min read

Quick & Helpful Slurm Commands

A quick guide to using Slurm for distributed machine learning.

#ml/ai

Sep 8, 20232 min read

Setting Up Docker for Machine Learning

The Dockerfile I use to set up my machine learning environment.

#ml/ai

Aug 29, 20233 min read

Accelerate vs. DeepSpeed vs. FSDP

Which one should you use for distributed training?

#ml/ai

Aug 23, 20232 min read

LLMs Will Never Be Able to Do (Complicated) Math

Since contemporary LLM architectures lack recursion, they're fundamentally incapable of doing some math operations.

#ml/ai

Ben Gubler

Dialects for Humans: Sounding Distinct from LLMs

Using HuggingFace Datasets Offline

Tips #1

Rebuilding Alpaca with the Hugging Face Trainer Class

Introducing gom: GPU Monitoring across Containers

Enroot on Slurm for Distributed ML: Part 2

Enroot on Slurm for Distributed ML: Part 1

Quick & Helpful Slurm Commands

Setting Up Docker for Machine Learning

Accelerate vs. DeepSpeed vs. FSDP

LLMs Will Never Be Able to Do (Complicated) Math