- cross-posted to:
- datahoarder@lemmy.world
- cross-posted to:
- datahoarder@lemmy.world
cross-posted from: https://lemmy.world/post/39027950
I’ve been running OCR on the recent house epstein email dump. Making this available now that its close to finishing (20k/ 23k emails processed).
Processing script available here: https://codeberg.org/sillyhonu/Image_OCR_Processing_Epstein
I also put an analysis script in there if you want to use drive/ colab.
Currently finished files are available here:
So Lawrence m krauss is mentioned a lot? Lots of discussion about physics?
I’m watching it process the last two batches now.
I think they photographed an entire textbook.
For example, this just went through my console:
<|ref|>text<|/ref|><|det|>[[149, 85, 884, 157]]<|/det|> topological dimension as that of a line equal to one. If each time step had the largest up or down amplitude as possible, its fractal dimension would approach (but not reach) that of the embedding plane, Euclidean (d = 2) .
<|ref|>text<|/ref|><|det|>[[147, 169, 886, 680]]<|/det|> The (D_0) of the one dimensional Richardson technique (Mandelbrot, 1967) can be computed by covering the one dimensional surface of a time series with a number, #, of line segments of several orders of magnitude range of lengths, / . Graphing (\log (l)) along the (x) - axis and (\log # (l)) along the (y) - axis yields a negative linear slope, - s. As defined, (1 - s = D_0) noting that ((- (- s)\rightarrow +s)) such that (1< D_0 = 1 + s< 2) Strain differences and peptide and psychotropic drug- induced changes in (D_0) computed in this way were found in time series of fluctuations in rat brainstem tyrosine and tryptophan hydroxylase activities under far- from- equilibrium co- reactant concentrations (Mandell and Russo, 1981; Knapp et al, 1981; Knapp and Mandell, 1983; 1984). Systematic influences of stimulant drug dose on (D_0) were found as well in these systems (Mandell et al, 1982). This simple measure, made directly on the “roughness” of the graph of a one dimensional time series rather than on its orbital reconstruction, has been used to discriminate the pattern of fluctuations in daily mood scales in normal subjects and mood disordered patients (Woyshville et al, 1999). These findings confirmed dimensional scaling exponents on higher dimensional embeddings of similar time series in mood disordered patients (Gottschalk et al, 1995; Pezard et al, 1996). Due to the ease and rapidity of its computation, techniques involving (D_0) on one dimensional time series are currently in development as possible real time epilepsy predictors when analyzing the output of a large number of EEG leads simultaneously.
<|ref|>text<|/ref|><|det|>[[148, 692, 886, 876]]<|/det|> If (M(\epsilon)) is the minimum number of d- dimensional cubes of side (\epsilon) required to cover the d- dimensionally embedded attractor, plotting a logarithmic range of rulers of length (\epsilon) (as (\epsilon \rightarrow 0) ) along the (x) axis and a logarithmic range of number of cubes, (M(\epsilon)) , each of corresponding (\epsilon) - edge size, along the (y) axis, results in a negative (more smaller (M(\epsilon)) 's and fewer bigger (M(\epsilon)) 's) power law slope (D_0) . Here the numbered covering cubes, (M(\epsilon)) , are those in which the probability of containing at least one point (its “probability density measure,” often called (\mu) ) is not zero. We
This isn’t physics though, it reads like complex systems analysis…
and in fact I think I might even have the text book this is from…
I read an article about their correspondence. Epstein even helped Krauss with his sexual misconduct allegations which is apparently one of the reason Krauss retired.
I mean you can download the db posted and search.

