How Microsoft & IBM’s SLMs Are Making AI More Sustainable

Is bigger always better? As computational demands continue to escalate, tech giants IBM and Microsoft are pioneering a sustainable approach to artificial intelligence that challenges the long-held belief.
Their latest small language models (SLMs) represent a breakthrough in energy-efficient, cost-effective AI technology that promises to reshape how businesses and developers approach artificial intelligence.
Making sustainable AI
The traditional AI development race has been characterised by a relentless pursuit of ever-larger models, consuming massive computational resources and energy.
However, both IBM and Microsoft are now demonstrating that smaller, more focused AI models can deliver powerful performance while significantly reducing environmental and economic costs.
IBM's Granite: Compact and Capable
IBM's latest Granite 3.2 models epitomise this new approach. By focusing on compact systems designed for specific business applications, the company has developed models that:
- Reduce computational requirements by up to 30% in the Guardian safety models
- Process complex document understanding tasks with minimal resource consumption
- Offer optional "chain of thought" reasoning to optimise computational efficiency
The TinyTimeMixers models are capable of two-year forecasting with fewer than 10 million parameters – a stark contrast to the hundreds of billions of parameters in traditional large language models.
Microsoft's Phi-4: Multimodal Efficiency
Microsoft's Phi-4 family takes a similar approach, introducing two groundbreaking models:
- Phi-4-multimodal: A 5.6B parameter model that processes speech, vision, and text simultaneously
- Phi-4-mini: A 3.8B parameter model optimized for text-based tasks
These models are designed for compute-constrained environments, making them ideal for integration into smartphones, vehicles and other devices with limited computational resources.
“Phi-4-multimodal marks a new milestone in Microsoft’s AI development as our first multimodal language model,” says Weizhu Chen, Vice President, Generative AI at Microsoft.
“By leveraging advanced cross-modal learning techniques, this model enables more natural and context-aware interactions, allowing devices to understand and reason across multiple input modalities simultaneously.
“Whether interpreting spoken language, analysing images, or processing textual information, it delivers highly efficient, low-latency inference – all while optimising for on-device execution and reduced computational overhead.”
Beyond performance: A sustainable vision
Both companies emphasise that the future of AI isn't about raw computational power, but about efficiency, integration and real-world impact.
“The next era of AI is about efficiency, integration and real-world impact – where enterprises can achieve powerful outcomes without excessive spend on compute,” says Sriram Raghavan, Vice President of IBM AI Research.
The key sustainability benefits include.
- Reduced energy consumption: Smaller models require significantly less energy to train and operate.
- Lower carbon footprint: Decreased computational needs translate to reduced greenhouse gas emissions.
- Increased accessibility: More affordable AI solutions for smaller organizations
- Flexible deployment: Ability to run advanced AI on edge devices and in resource-constrained environments.
Explore the latest edition of Sustainability Magazine and be part of the conversation at our global conference series, Sustainability LIVE.
Discover all our upcoming events and secure your tickets today.
Sustainability Magazine is a BizClik brand


