Unlike dense models, a MOE model can realistically be run locally on a CPU and GPU.
· Sign up or log in to comment