Image 1

Webpage for the University of Chicago Data Science Clinic

Hosted on GitHub Pages — Theme by orderedlist

Removing sensitive data from git

This doc is a WIP and needs to be cleaned up

Every time I do this it’s kind of an ordeal (and there’s a few different tools). Here’s what I did last time — could maybe be added to Clinic docs or wherever this sort of thing belongs.

Ok, this is always a nightmare but here’s how you use the BFG repo cleaner:

git clone --mirror https://github.com/your/repo
java -jar ~/bfg-1.14.0.jar --delete-files '*.csv'
git config http.postBuffer 524288000
! [remote rejected] dev -> dev (protected branch hook declined)
+ 74bca8b...eff0c7b dev -> dev (forced update)
+ 74bca8b...307119b main -> main (forced update)