In this episode I sat down with Mikolaj (Miko) Czerkawski from Asterisk Labs to explore BetaEarth, an experimental open-source emulator trained on AlphaEarth Foundations' public embedding archive. AEF — released by Google and Google DeepMind as a global 10 m embedding product derived from a wide range of Earth-observation modalities — is what makes BetaEarth possible: its openness lets the community build lightweight independent emulators that approximate AEF's pixelwise outputs from standard Sentinel inputs, and use them to probe how much of a model's behaviour is captured in its public embeddings. Miko walks through BetaEarth's design — compact architectures based on SegFormer-B2 with separate per-modality encoders, and a shared DINOv3 backbone over 3-band spectral primitives — and the surprising finding that reasonably strong approximations can be achieved even from simple RGB inputs.
We then dive into a live demo: generating BetaEarth embeddings for arbitrary regions and time ranges using Sentinel-1, Sentinel-2, and COP-DEM data. Along the way, we cover practical considerations such as cloud contamination, modality trade-offs, tiling artefacts, and strategies for merging multi-temporal signals. Finally, we discuss what this complementary tooling enables for the geospatial ML community — embeddings as pretraining or regularisation signals, lightweight local inference alongside AEF's global annual rasters, and what the combination of large proprietary archives and open emulator-style tools could unlock next.
Bio: Miko is a researcher specialising in AI, computer vision, signal processing and Earth observation. Before co-founding Asterisk Labs he was a postdoctoral research fellow at the European Space Agency. His research interests include data-centric analyses of large-scale Earth observation data, dataset curation, generative modelling, and restoration tasks for satellite imagery. He is a co-founder of the Major TOM community project, a platform for collaborating and reusing Earth observation datasets designed specifically for AI pipelines. He received the B.Eng. degree in electronic and electrical engineering in 2019 from the University of Strathclyde in Glasgow, United Kingdom, and the Ph.D. degree in 2023 at the same institution, specialising in applications of computer vision to Earth observation data.











