Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

How to easily sign PDFs online using PDF Signer

June 10, 2026

How agents built a 3D Paris gallery by chaining together two hugging face spaces

June 9, 2026

Gemini 3.5: Frontier Intelligence with Action

June 9, 2026
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Wednesday, June 10
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
  • Resources
Versa AI hub
Home»Tools»A family of more efficient Earth observation models
Tools

A family of more efficient Earth observation models

versatileaiBy versatileaiMay 21, 2026No Comments5 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
#image_title
Share
Facebook Twitter LinkedIn Pinterest Email

🧠 Model: https://huggingface.co/collections/allenai/olmoearth | 📄 Technical report: https://allenai.org/papers/olmoearth_v1_1 | 💻 Code: https://github.com/allenai/olmoearth_pretrain

OlmoEarth v11 blog and social copy - Google Docs-image-1

We released OlmoEarth (v1) in November 2025. Since then, partners have applied OlmoEarth (v1) to a wide range of tasks, from tracking mangrove change to classifying the causes of forest loss, to creating country-scale crop type maps in days, to expanding deployment to countries, continents, and global regions. Each release brings us closer to our mission of providing cutting-edge AI to organizations and communities working to protect people and the planet.

Efficiency shapes what’s possible when OlmoEarth processes satellite imagery to make predictions that range from tens of thousands of square kilometers to hundreds of thousands of square kilometers. Throughout OlmoEarth’s execution lifecycle (data export, preprocessing, inference, and postprocessing), the compute costs are overwhelming. A more efficient model means we can support more partners on the OlmoEarth platform, and anyone running OlmoEarth themselves can take advantage of this technology faster and at lower cost.

That’s why we built OlmoEarth v1.1. This is a new family of models that maintains the performance of OlmoEarth v1 on a combination of research benchmarks and tasks built with our partners while reducing compute costs by up to 3x.

Reduce sequence length and increase efficiency

OlmoEarth models are transformer-based models and are one of the leading architectures in machine learning today. To process remote sensing data, first convert it into a sequence of tokens that the model can ingest.

Two important factors that control the efficiency of transformer-based models are model size (this is why we release families of models so that users can choose the size that fits their computing budget) and the length of the token sequence. Computational costs increase quadratically with the length of the token sequence, so even small reductions can significantly reduce model execution costs.

bench-capture-2026-05-18T14-40-39

MAC, or multiply-accumulate operation, estimates the computations required for one forward pass of the model. In general, a lower MAC means cheaper and faster inference. The y-axis is inverted because the lower the average rank, the better. The label shows the model family and size. All plotted points use the pasted MAC/Rank values.

Token design

This raises an important question: what should the token represent for a transformer-based remote sensing model?

Consider Sentinel-2 images, a common modality that we process. The Sentinel-2 input will be a tensor with height and width (H, W represent latitude and longitude pixels), time dimension T, and 12 Sentinel-2 channels ((H, W, T, D=12)).

OlmoEarth v11 Blog Copy - Google Docs-image-3

We are currently splitting the data into resolution-based patches. Specifically, this means choosing a spatial patch size p and dividing the entire Sentinel-2 image into patches of size pxp.

OlmoEarth v11 blog and social copy - Google Docs-image-4

Create tokens for each patch, each timestep, and each resolution. Therefore, a Sentinel-2 input with 2 timesteps will produce 6 tokens per patch (2 timesteps x 3 resolutions, 10m, 20m, and 60m).

In total, a(H, W, T, D=12) Sentinel-2 input produces H/px W/px T x 3 tokens.

When processing Sentinel-2 data, a common practice is to use unique tokens for each resolution. Galileo and SatMAE both take this approach, and SatMAE shows significantly better results when it does. However, this is not universal. CROMA is a model that uses only one token for all bands, regardless of resolution. The number of tokens increases multiplicatively, so collapsing the resolution to a single token reduces the tokens by a factor of three, saving material across pre-training, fine-tuning, and inference.

Simply combining tokens in this way leads to significant performance degradation, such as a 10 ppt drop on m-eurosat kNN (a common benchmark task for remote sensing models). We hypothesize that separating Sentinel-2 bands into different tokens will make it easier for OlmoEarth to model important cross-band relationships.

We had to change the pre-training plan to combine tokens without impacting performance. These changes are detailed in the paper.

For developers

The result is a family of models that do more with less. At any size, OlmoEarth v1.1 runs up to 3x cheaper than OlmoEarth v1, making frequent planet-scale map updates more affordable for all teams running OlmoEarth. If you are using models from the original OlmoEarth family, try OlmoEarth v1.1. It provides similar performance to OlmoEarth v1 while requiring a third of the compute, but with some setbacks (see technical report for details). If it works well for your task, you should see significant speedups during fine-tuning and inference.

For researchers

Pre-trained remote sensing models have many degrees of freedom, making them difficult to study. When performance changes, is it the architecture, the dataset, or the pre-training algorithm?

Because we are training OlmoEarth v1.1 on the same dataset as OlmoEarth v1, the two differences separate the impact of the methodology change. We hope this will advance our understanding of the scientific principles behind pre-training models for remote sensing.

Let’s get started

Check out the weights and training code for OlmoEarth v1.1, including weights for Base, Tiny, and Nano models.

author avatar
versatileai
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleEnterprise AI Obstacles and Roadmap, Security and Physical AI: TechEx Day 2
Next Article Nvidia Vera chips, Huang opens up second front and aims for $200 billion market
versatileai

Related Posts

Tools

How to easily sign PDFs online using PDF Signer

June 10, 2026
Tools

How agents built a 3D Paris gallery by chaining together two hugging face spaces

June 9, 2026
Tools

Gemini 3.5: Frontier Intelligence with Action

June 9, 2026
Add A Comment

Comments are closed.

Top Posts

Switzerland releases its own completely open AI model

September 4, 2025138 Views

Data and AI Status: Security and Privacy

July 12, 202588 Views

The Colorado AI Act was delayed until June 2026

September 21, 202571 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

Switzerland releases its own completely open AI model

September 4, 2025138 Views

Data and AI Status: Security and Privacy

July 12, 202588 Views

The Colorado AI Act was delayed until June 2026

September 21, 202571 Views
Don't Miss

How to easily sign PDFs online using PDF Signer

June 10, 2026

How agents built a 3D Paris gallery by chaining together two hugging face spaces

June 9, 2026

Gemini 3.5: Frontier Intelligence with Action

June 9, 2026
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2026 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?