- cross-posted to:
- datahoarder@lemmy.world
- cross-posted to:
- datahoarder@lemmy.world
cross-posted from: https://lemmy.world/post/39027950
I’ve been running OCR on the recent house epstein email dump. Making this available now that its close to finishing (20k/ 23k emails processed).
Processing script available here: https://codeberg.org/sillyhonu/Image_OCR_Processing_Epstein
I also put an analysis script in there if you want to use drive/ colab.
Currently finished files are available here:

I’m watching it process the last two batches now.
I think they photographed an entire textbook.
For example, this just went through my console:
This isn’t physics though, it reads like complex systems analysis…
and in fact I think I might even have the text book this is from…
I read an article about their correspondence. Epstein even helped Krauss with his sexual misconduct allegations which is apparently one of the reason Krauss retired.
I mean you can download the db posted and search.