IBM releases new granite 4.0 model with new hybrid Mamba-2/transformer architecture: significantly reduces memory usage without sacrificing performance
IBM just released Granite 4.0, an open source LLM family that swaps the overall transformer into a hybrid Mamba-2/Transformer stack to cut memory while maintaining quality. Sizes across the 3B density “Micro”, 3B hybrid...