1 |
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Measurement of $W^{\pm}$-boson and $Z$-boson production cross-sections in $pp$ collisions at $\sqrt{s}=2.76$ TeV with the ATLAS detector
|
|
|
|
BASE
|
|
Show details
|
|
|
|