The Gradient @thegradient

Recent searches

Search options

Only available when logged in.

Ted Underwood @TedUnderwood@sigmoid.social

The thing I'm most interested in, this morning, are the leaks suggesting that GPT-4 is a "faster" version of ChatGPT. Could be nonsense. But if not ... making these things lighter weight (or able to handle longer contexts at the same weight) would really change the business case and (also, selfishly) matter a lot for academics.

Feb 05, 2023, 02:35 PM··Web

3boosts·6favorites

**Ted Underwood** @TedUnderwood · Feb 5, 2023 *

Feb 5, 2023 *

Ted Underwood @TedUnderwood

E.g., imagine we had a model that could do reliable summarization of 10,000-word texts and also run on one GPU. A year later the text-analysis part of @dh would look completely different. Everyone would be using that model.

**Ted Underwood** @TedUnderwood · Feb 5, 2023

Feb 5, 2023

Ted Underwood @TedUnderwood

This also explains why there has been no rush to publish the GPT-4 paper. An abstractly interesting architecture? Great, publish. But improvements that make LLMs more *deployable* ... you probably wouldn't publish those until you were ready to deploy.

**Micah Corah** @micahcorah · Feb 5, 2023

Feb 5, 2023

Micah Corah @micahcorah

@TedUnderwood There's been so much speculation... Are there any real signs that the reason it hasn't been released isn't something more mundane like the development process being a shitshow?