Back to news
backend Priority 4/5 7/1/2026, 11:05:15 AM

NAVI-Orbital Demonstrates First Onboard Zero-Shot Vision-Language Model on Low Earth Orbit Satellite

NAVI-Orbital Demonstrates First Onboard Zero-Shot Vision-Language Model on Low Earth Orbit Satellite

The rapid growth of Earth observation data has created a severe bottleneck due to limited downlink bandwidth and the manual overhead of ground-based processing. To address this, NAVI-Orbital shifts the processing paradigm by executing a vision-language model, specifically Gemma 3, directly on satellite-class edge computers. This architecture enables real-time scene classification, natural-language description generation, and interactive operator dialogue onboard the spacecraft. Orchestrated by a LangGraph-based state machine, the system coordinates dedicated agent workflows for object detection and dialogue. Operators can re-task the satellite using plain-English prompts instead of traditional command sequences. This edge capability allows the system to compress raw image data semantically, prioritizing critical information for downlink and drastically reducing the required transmission bandwidth. Validation of the system included ground benchmarking on the AID dataset, achieving an 88.16% accuracy rate, followed by successful live in-orbit execution using uncorrected imagery. The demonstration, conducted on April 16, 2026, proves the feasibility of running modern foundation models on edge-compute hardware in space without requiring instrument-specific fine-tuning.

Related tools

Recommended tools for this topic

These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.

#arxiv#research#ai#agent

Comparison

AspectBefore / AlternativeAfter / This
Data Processing LocationGround stations after transferring raw, high-bandwidth imagery downlinksOnboard edge computing to perform semantic compression prior to downlink
Satellite Re-taskingRigid, conventional hardware command sequencesPlain-English natural-language prompts via agent workflows
System OrchestrationProcedural embedded flight software routinesGraph-based state machine using LangGraph to coordinate agents
Model AdaptabilityHighly specialized models fine-tuned for specific flight instrumentsZero-shot foundation model (Gemma 3) requiring no instrument-specific fine-tuning

Source: arXiv

This page summarizes the original source. Check the source for full details.

Related