Skip to main content
Products
Support
Vision
Careers
EN
JA
KO
EN
JA
KO
Vision Home
white-paper
Announcements
Machine Learning
Newsroom
White Paper
Software
Talk
CPU
Podcast
RISC-V
Research
Talk
Open Source
Architecture
Physical Design
System Engineering
Cloud
See all posts
A Path Towards Autonomous Machine Intelligence
How could machines learn as efficiently as humans and animals? How could machines learn to reason and plan?
Read more
Design Principles for Lifelong Learning AI Accelerators
We explore the design of lifelong learning AI accelerators that are intended for deployment in untethered environments
Read more
A Review of Sparse Expert Models in Deep Learning
Sparse expert models are a thirty-year old concept re-emerging as a popular architecture in deep learning...
Read more
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts
We present MegaBlocks, a system for efficient Mixture-of-Experts (MoE) training on GPUs
Read more
The Reversible Residual Network: Backpropagation Without Storing Activations
Deep residual networks (ResNets) have significantly pushed forward the state-of-the-art on image classification...
Read more
UDC: Unified DNAS for Compressible TinyML Models for Neural Processing Units
Deploying TinyML models on low-cost IoT hardware is very challenging, due to limited device memory capacity...
Read more
Predictive Coding Towards a Future of Deep Learning Beyond Backpropagation
The backpropagation of error algorithm used to train deep neural networks has been fundamental to the successes of deep learning...
Read more
Reversible Architectures for Arbitrarily Deep Residual Neural Networks
Recently, deep residual networks have been successfully applied in many computer vision and natural language processing tasks...
Read more
Decoupled Neural Interfaces using Synthetic Gradients
Training directed neural networks typically requires forward-propagating data through a computation graph..
Read more
Attention Is All You Need
We propose a new simple network architecture
Read more