Make Neural Networks Faster

Methods for compressing and accelerating deep learning models - Papers for each task's

Sep 5, 2022 • Prince • 3 min read

Applications
Distillation
Pruning
Neural architecture search
Benchmarking
Quantization
Accelerating training
Multimodal
Task-specific tricks
Architecture Specific Trick
Speech
Carbon footprint and alternative power sources
New papers

Applications

Distillation

Pruning

Neural architecture search

Benchmarking

Quantization

Accelerating training

Multimodal

Task-specific tricks

Architecture Specific Trick

CNNs

Softmax

Efficient softmax approximation for GPUs

Embeddings/inputs

Adaptive Input Representations for Neural Language Modeling

Transformers

Speech

Carbon footprint and alternative power sources

New papers