Top

IBM has built a cost-effective AI supercomputer in its cloud

Gartner warns of 'inflationary pressure' risk to global public cloud spending...

Fintech, Cloud, and Bringing Machine Learning to the Edge

5G is important to cloud computing (but not that important)

3 Reasons Why Cloud Contact Centres Are the Next Step in Customer Success

image credit: Pixabay

Amazon begins shifting Alexa’s cloud AI to its own silicon

November 13, 2020

Via: ArsTechnica

Category:

Global Cloud

On Thursday, an Amazon AWS blogpost announced that the company has moved most of the cloud processing for its Alexa personal assistant off of Nvidia GPUs and onto its own Inferentia Application Specific Integrated Circuit (ASIC). Amazon dev Sebastien Stormacq describes the Inferentia’s hardware design as follows:

AWS Inferentia is a custom chip, built by AWS, to accelerate machine learning inference workloads and optimize their cost. Each AWS Inferentia chip contains four NeuronCores. Each NeuronCore implements a high-performance systolic array matrix multiply engine, which massively speeds up typical deep learning operations such as convolution and transformers.

Read More on ArsTechnica

Amazon begins shifting Alexa’s cloud AI to its own silicon

Latest Publications

AWS is investing billions in one of its biggest US cloud regions

Cloud security teams: What to know as M&A activity rebounds in 2024

Do you need to repatriate from the cloud?

Amazon begins shifting Alexa’s cloud AI to its own silicon

Previous Article

Next Article

RELATED PUBLICATIONS

Trending

Tags

Latest Publications

AWS is investing billions in one of its biggest US cloud regions

Cloud security teams: What to know as M&A activity rebounds in 2024

Do you need to repatriate from the cloud?