Five years ago, DairyNZ relied on Excel spreadsheets and a small team of statisticians for its analytics. They were limited in their ability to handle large datasets effectively on local laptops. Insights were siloed, reproducibility was a challenge, and farmers had to wait 12–24 months for the release of economic surveys. In a sector where fertilizer, feed, and fuel prices can spike within a season, such delays limit the value of data for real-world farm decisions.
At the same time, new technologies were generating a “metric ton” of high-frequency data. Wearable sensors for thousands of cows (think Apple watches for cows), weather stations, and IoT devices began streaming 60 observations per second, demanding scalable infrastructure and modern workflows.
Traditional linear science—experiment, analysis, publish—could not keep up with the new scale and speed of data. DairyNZ needed “fit for purpose tools” that supported iterative, real-time analytics and made insights easily shareable with farmers and analysts alike. They needed to transition from having siloed R users to supporting an IT-backed data science team.