Hacker News

by borzunovon 7/19/23, 8:52 PMwith 3 comments

by borzunovon 7/19/23, 8:52 PM

by amrbon 7/19/23, 10:20 PM

Great project and I'm happy to see it expand to more models!

Petals runs Llama 2 (70B) from Colab at 5 tokens/sec