Machine Learning News Hubb
Advertisement Banner
  • Home
  • Machine Learning
  • Artificial Intelligence
  • Big Data
  • Deep Learning
  • Edge AI
  • Neural Network
  • Contact Us
  • Home
  • Machine Learning
  • Artificial Intelligence
  • Big Data
  • Deep Learning
  • Edge AI
  • Neural Network
  • Contact Us
Machine Learning News Hubb
No Result
View All Result
Home Edge AI

The Future of AI is Hybrid: How On-device AI is Enabling Generative AI to Scale

admin by admin
May 15, 2023
in Edge AI


This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm.

Providing cost savings as well as performance, personalization, privacy and security benefits

As generative artificial intelligence (AI) adoption grows at record-setting speeds1 and computing demands increase2, hybrid processing is more important than ever. But just like traditional computing evolved from mainframes and thin clients to today’s mix of cloud and edge devices, AI processing must be distributed between the cloud and devices for AI to scale and reach its full potential.

A hybrid AI architecture distributes and coordinates AI workloads among cloud and edge devices, rather than processing in the cloud alone. The cloud and edge devices — smartphones, cars, personal computers, and Internet of Things (IoT) devices — work together to deliver more powerful, efficient and highly optimized AI.

The main motivation is cost savings. For instance, generative AI-based search cost per query is estimated to increase by 10 times compared to traditional search methods3 — and this is just one of many generative AI applications.

Hybrid AI will allow generative AI developers and providers to take advantage of the compute capabilities available in edge devices to reduce costs. A hybrid AI architecture (or running AI on-device alone) offers the additional benefits of performance, personalization, privacy and security at a global scale.

These architectures can have different offload options to distribute processing among cloud and devices depending on factors such as model and query complexity. For example, if the model size, prompt and generation length is less than a certain threshold and provides acceptable accuracy, inference can run completely on the device. If the task is more complex, the model can run across cloud and devices.

Hybrid AI even allows for devices and cloud to run models concurrently — with devices running light versions of the model while the cloud processes multiple tokens of the full model in parallel and corrects the device answers if needed.


In a device-centric hybrid AI architecture, the cloud is only used to offload AI tasks that the device cannot sufficiently perform.

Scaling generative AI with edge devices

The potential of hybrid AI grows further as powerful generative AI models become smaller while on-device processing capabilities continue to improve. AI models with more than 1 billion parameters are already running on phones with performance and accuracy levels similar to those of the cloud, and models with 10 billion parameters or more are slated to run on devices in the near future.

The hybrid AI approach is applicable to virtually all generative AI applications and device segments — including phones, laptops, extended reality headsets, cars and IoT. The approach is crucial for generative AI to scale and meet enterprise and consumer needs globally. We truly believe that the future of AI is hybrid. Read our whitepaper to learn more.

For more information

References

Ziad Asghar
SVP, Product Management, Qualcomm Technologies

Dr. Jilei Hou
Vice President, Engineering, Qualcomm Technologies





Source link

Previous Post

3 Free Platforms for Personalized ChatGPT Experience

Next Post

roadmap to learn data science for fresher

Next Post

roadmap to learn data science for fresher

Demand forecasting at Getir built with Amazon Forecast

Reducing SaaS spend in 2023

Related Post

Artificial Intelligence

How to Implement Random Forest Regression in PySpark | by Yasmine Hejazi | Sep, 2023

by admin
September 26, 2023
Machine Learning

Mastering The Method of Choosing Your Most Accurate Machine Learning Algorithms: A Comprehensive Guide

by admin
September 26, 2023
Machine Learning

Mastering How to Calculate the Return on Equity: A Guide

by admin
September 26, 2023
Deep Learning

How Observability in DevOps is Transforming Dev Roles

by admin
September 26, 2023
Artificial Intelligence

Innovation for Inclusion: Hack.The.Bias with Amazon SageMaker

by admin
September 26, 2023
Edge AI

Flex Logix Expands Upon Industry-leading Embedded FPGA Customer Base

by admin
September 26, 2023

© Machine Learning News Hubb All rights reserved.

Use of these names, logos, and brands does not imply endorsement unless specified. By using this site, you agree to the Privacy Policy and Terms & Conditions.

Navigate Site

  • Home
  • Machine Learning
  • Artificial Intelligence
  • Big Data
  • Deep Learning
  • Edge AI
  • Neural Network
  • Contact Us

Newsletter Sign Up.

No Result
View All Result
  • Home
  • Machine Learning
  • Artificial Intelligence
  • Big Data
  • Deep Learning
  • Edge AI
  • Neural Network
  • Contact Us

© 2023 JNews - Premium WordPress news & magazine theme by Jegtheme.