IBM has released Granite 4.0 3B Vision, a lightweight multimodal AI model designed specifically for enterprise document handling and analysis. The compact 3 billion parameter model combines vision and language capabilities in a footprint small enough for deployment on-premises or in resource-constrained environments, addressing a critical need among organizations seeking to automate document processing without the computational overhead of larger models. The release represents a significant development in the enterprise AI space, where organizations increasingly seek efficient alternatives to massive foundation models for specialized tasks. By focusing on document understanding—a core workflow across finance, healthcare, legal, and insurance sectors—Granite 4.0 3B Vision targets the practical concerns of cost-conscious enterprises while maintaining competitive performance for its size class. The model's design emphasizes performance per parameter, making it accessible to a broader range of organizations than previous generations.