Petals runs Llama 2 (70B) from Colab at 5 tokens/sec

by borzunovon 7/19/23, 8:52 PMwith 3 comments
by amrbon 7/19/23, 10:20 PM

Great project and I'm happy to see it expand to more models!