Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
v1v2v3v4 (latest)

Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

    MoE

Papers citing "Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models"