One of the more surprising facts we learned from ARM when we sat down with them last year to discuss their Mali GPU architecture was just how successful the Mali-400 family was and is; not just in the past tense but in the present tense as well. Although ARM has been selling Mali T600/700/800 designs based on their Midgard architecture for a few years now, the Utgard based Mali-400/450 have continued to sell well due Utgard’s “no-frills” OpenGL ES 2.0 design. This has made the Mali-400/450 popular parts in low cost and low power environments, where a complete Mali design takes up a minimal amount of die space and similarly small amounts of power.
Though chasing the low-end of the market doesn’t come with the glamour of the high-end (and for that there’s Midgard), for ARM it has in many ways the textbook low-price/high-volume success story. The company’s estimates put them in plenty of low-end tablets and smartphones, but more unexpectedly they have a lion’s share of the smart TV market, with 75% of TVs implementing a Mali design. With TVs only requiring enough graphics power for basic UI functions and video decoding at 1080p – few smart TVs play games or do fancy transparency/animation effects – even after having been eclipsed by more powerful GPUs at the high-end, the Mali-400 family has found a second home in these sort of simple devices that need little more than a no-frills GPU, be it for cost reasons, power reasons, or both.
Given the continued success of the Mali-400 family it should come as little surprise then that ARM has continued to develop the family, as the need for OpenGL ES 2.0 won’t be coming to an end any time soon. To that end, today ARM is announcing the release of the next iteration of the Mali-400 family IP, the Mali-470. Mali-470 serves as something of a decedent of both Mali-400 and Mali-450, with ARM intending to make it a direct successor to Mali-400 while also integrating some of the improvements that original went in to Mali-450 as well.
With the Mali-470 ARM is specifically targeting the IoT and embedded market, even more so than they were already going after it with the Mali-400 and Mali-450. In recent times the Mali-450 has been ARM’s more powerful Mali-400 family GPU, specifically going after low-end devices and the aforementioned TV market. Meanwhile thanks in part to the introduction of smaller processes since its introduction in 2008, the Mali-400 has already been something of a darling for low-power applications, appearing in wearable devices such as Samsung’s Gear S2. ARM and their licensees continue to be bullish on wearables and IoT in general, and this has lead to ARM setting out to develop a more direct update to the Mali-400 to further improve on its energy efficiency.
Overall ARM is aiming to double Mali-400’s energy efficiency with Mali-470, in accordance with their goals to reach further into the wearables and IoT market. To do so, ARM has taken some of Mali-450’s energy optimizations and further built on them; chief among these changes are far more extensive use of power gating to shut off transistors when they’re not needed, and improved use of clock gating when power gating isn’t an option. In order to accomplish this ARM has implemented separate power domains on Mali-470 – each pixel/fragment block can be on its own domain – and working with these domains is an on-GPU power controller, which works with the system power controller to better manage the GPU.
Improving the use of power and clock gating can greatly impact overall energy efficiency by cutting down on the amount of energy wasted by the GPU waiting for its next job, but it’s especially potent for bringing down idle power consumption. In turn, given that idle power consumption continues to be one of the chief issues dogging wearables vis-a-vis battery life, the Mali-470 should help out battery life to a degree thanks to these improvements.
Along with these immediate power optimizations, ARM has also made some smaller optimizations to the Mali-400 family architecture to cut down on wasted power at a workload level. Mali-470’s vertex processor has been optimized to better handle fixed-point arithmetic, a use case that is again very common in simple UIs and can be even more power efficient than floating-point arithmetic. The pixel/fragment processor on the other hand has not gone untouched, with ARM working to cut down on the amount of time spent by the fragment processors changing between states.
Moving on, in terms of scalability Mali-470 retains the multi-processor capabilities of the previous Mali designs, allowing Mali-470 to be scaled up from 1 fragment processor (MP1) to as many as 4 (MP4). Mali-470MP1 will in turn be the intended design for wearables given its spot as the lowest-power design, while the more powerful designs will allow Mali-470 to be used in other environments where more performance is needed. That ARM would retain their scalability for Mali-470 is a very intentional choice; as we mentioned earlier Mali-470 is intended to be the successor to Mali-400, with ARM going so far as to keep Mali-400’s MMU and bus interfaces and holding Mali-470’s die area closer to Mali-400’s. This in turn means that Mali-470 needs to be able to carry on from where Mali-400 has left off, not only in the wearables/IoT market but in the embedded market as well.
Wrapping things up, as with past ARM IP launches today’s announcement coincides with the general release of new IP to ARM’s customers. ARM tells us that they expect chips integrating Mali-470 designs to start leaving the fabs in the second-half of 2016, which would in turn have Mali-470 start appearing in consumer devices in the first-half of 2017. Ultiamtely ARM doesn’t control which products use which IP, but given their intentions with Mali-470, there’s a good chance that those consumer devices will include Android/Tizen wearables launching early that year.