The Future of Large Language Model Pre-training is Federated

Hackworth@lemmy.world · edit-2 26 days ago

The Future of Large Language Model Pre-training is Federated

Martineski@lemmy.dbzer0.com · 26 days ago

Is this how you make a sentient planet?

Petter1@lemm.ee · 24 days ago

I like that 🤩

Martineski@lemmy.dbzer0.com · 26 days ago

I wonder if this will become a big thing in FOSS ai space. It’s hard to compete with corpos when it comes to computing power.

Audrey0nne@leminal.space · 26 days ago

Lot of words just to say that once the advertisers move in on a centralized platform its value is shot. A huge part of the reason I abandoned the last platform I was using and sought a federated alternative.

Hackworth@lemmy.world · 26 days ago

The papers have a ton of practical info about feasibility, implementation, etc.

General_Effort@lemmy.world · 24 days ago

As far as I know, federated learning is pretty much dead. The point would be that it allows organizations to create a joint model without sharing data. But it doesn’t look like anyone who doesn’t want to share data wants to share a model.

Hackworth@lemmy.world · 24 days ago

Until they can distribute the training load of large models to consumer graphics cards (and do something like SETI@Home) it does seem like the benefit of distributed training isn’t enough to overcome the friction.