NOTES Field notes from a senior data analyst
Notes in the margins.
Shorter pieces. Three a week. Data, ML, tools, the industry. The labs are the marquee form on this site. These are everything that doesn’t fit there. Subscribe via RSS, or check back from the homepage.
-
33 OG images in 80 lines of Python
Per-page OpenGraph cards without Figma. A one-shot Python script over your existing og:title tags.
-
The k=2 problem in cluster selection
Silhouette at k=2 is almost always highest and almost always meaningless. Why my segmentation lab grays it out instead of pretending it’s a real choice.
-
Why my A/B simulator says false positives are 26%, not 5%
Peeking every 50 users turns a 5% nominal false-positive rate into 26% empirical. Monte Carlo from the A/B Test Simulator lab.