sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

596
active users

Osma Suominen

@hisham
If there are any "fragments of your work" left in the output, they are very few and far between.

Take the recent Llama 3 8B model from Meta. It was trained on 15T tokens, around 100 terabytes (10^14 bytes) of text, including some written by you and me. The trained model can be downloaded as a set of files totalling around 16GB (1.6 * 10^10 bytes). There's no way all that text can be compressed by four magnitudes while retaining the original works within.

@mcpinson @mcc @WomanCorn