Yeah thanks. I said pseudo because I do t really know the meaning of these words apart from vic3, nor do I know of anyone from the americas :D
Yeah thanks. I said pseudo because I do t really know the meaning of these words apart from vic3, nor do I know of anyone from the americas :D
If I want to come off as a pseudo-intellectual I call them Yankee for east-north and Dixie for south-west (but also Florida and the bible belt) and gringo for hispanic Americans. I don’t know if any of those terms are really correct to use in that context and my definitions are entirely vibes-based.
Lmao there are so many ppl on Lemmy who unironically think like this. Let culture grow wild and fuck the profits.
But what is meant by “integrity of the model, inputs and outputs”?
I guess I don’t understand the attack vector, what’s the threat here? Someone messes with the model file or refines a model towards a specific malicious bias like inserting scam links where legit links would go and passes it off as the real deal?
I’m more general cybersec than crypto so idk but isn’t that what hash sums are for?
Surely if someone messed with my .ckpt or .safetensors it won’t be the same file anymore?
And what does that have to do with validity of the inputs?
I don’t understand what is exactly being verified there? Model integrity? Factors for “reasoning”?
Coke isn’t much better
https://www.youtube.com/watch?v=5HFZ3cH1UAI
Gosh I’m jelly though, I’m in the UK and tap water used to be drinkable, but now it’s more hydrocarbon than hydrogen dioxide these days. Joy of privatised utilities eh?
To be honest the one
Where V gets cured of the Relic by NUSA, especially if you just ignore V’s stupidity in trying to go back to Night City for some reason even though if you’re a nomad she would’ve only been there for like a few months total and actually imagine her being a semi-developed 23 year old adult who’d make the rational choice of just staying in DC in a cushy job away from the madness.
is pretty good.
Meh, even so, do you really wanna be Saburo? Watching your back constantly, watching your kids vie for your seat waiting for you to croak, seeking eternal life through different bodies as some microchip construct knowing basically anyone would smash that shit to bits if they felt they could get away with it to gain power.
It doesn’t sound that great to me either because I’m not sure even a normal, finite life in such a world isn’t too much already.
If I had to be someone I’d prefer to be a nomad living out in the desert with my fam, somewhat isolated, rely on each other, far away from corporations and their wars.
Or maybe I’d be a celebrity living in North Oak like Kerry, but even then, there’s a good chance some cyberpsycho guns you down while you’re out on the town in your snazzy new Caliburn to buy a latte, at last some brief relief from that one fucking ad with the dude screaming.
We’re mocking Musk, dummy
Idk I feel like if I want to visit a hustle or die dystopia I just go outside
It’s complicated.
I know Stable Diffusion best so I’ll speak to that, they used to the LAION-5B dataset, which is, in practice freely available to download and use:
https://www.kaggle.com/code/vitaliykinakh/guie-laion-5b-collect-and-download
https://github.com/opendatalab/laion5b-downloader
It’s also on HuggingFace but it’s unavailable.
https://huggingface.co/datasets/danielz01/laion-5b
But you can use this smaller newer version:
https://huggingface.co/datasets/laion/relaion2B-en-research
Whether it’s appropriately licensed is an unsolved question though.
The dataset itself and the text portion of the text-imags pairs needed for training is CC-BY-SA, the newer versions linked above are CC-BY-4.0. https://creativecommons.org/licenses/by/4.0/deed.en
The images however are technically under their own copyright, which in practice means each of the billions of images could or could not have a licence that implicitly or explicitly forbids AI training use or forbids it only for commercial use.
Whether such a license is legally binding is at present unknown though, since licenses primarily deal with reproductions, which the pro-AI folks argue isn’t the case, and that training of NNs is more akin to viewing an image and memorising the patterns and relationships within, like a person viewing it.
That would make it non-infringing and therefore the model itself libre. In that case Mistral and LLaMa are also libre as long as the model itself is open source, which in this case really means “open weights”, so not like GPT and anything by “”“OpenAI”“”.
Weights are the result of a model being trained essentially. They’re they key bit that makes it or breaks it and how it works. Given that and knowing the structure of the model and framework used you can refine, modify and distribute it.
Those against AI will say that it’s more akin to file compression and that in one form or another it’s misuse. That would make the model an infringing derivative work and as such nor libre even if the model weights are open source.
In a way though you could argue that me vaguely memorising the imagery of a dude dressed in white holding a laser sword is just a lossy compressed copy of the copyrighted work of Star wars, and it’d be absurd to think that’s a violation and that infringement only occurs if I reproduce a work of substantial similarity commercially from that memory.
If I use Krita and draw a beautiful landscape which has been informed and inspired by at least in part by a movie I saw, is that copyright infringement or not? What if I use AI?
Well, current laws don’t say. We measure infringement in substantial similarity, provenance of information only comes in later (e.g. to prove against accidental similarity).
That’s also my own personal stance on the legal side of things, so up to you how you see it.
Mistral? Deepseek?
Not LLM but also SD which uses a very popular free dataset.
Lolwut? Public good is self-entitlement? Go read a fucking book. Communists are not pro-copyright, especially not when it only benefits the giant corpos.
Another day, another entitled artoid larping as progressive blocked.
As a socialist I believe intellectual property is a falsehood and technological advancement should be for the public good. Open source LLMs are for the public good.
Given the options between having open source LLMs and the US Govt banning non-corpo non-proprietary LLMs and giving a free pass to people like Musk and Altman and Zucc to monopolize, I happily pick the former.
You’re delusional if you think they will pay anyone, the only way zucc will pay is with a guillotine.
Corpos will make inter-platform deals that’ll simply make all online data licensable for the right price and enrich each other so you can’t avoid it while still actually being a career creative, but price out academic researchers and the public sector so that all fruits of it stay behind closed R&D doors and be free of ethics etc.
Continuing in your role as a useful idiot, you’ll also most likely also foot the bill for it via subsidies from your taxes to “develop the AI sector” in some anti-China dick measuring contest by the US.
You will then be sold this data back via proprietary chat bots via a monthly subscription and you better pay up because once it gets really good, it will become mandatory to use for just about any job, leaving you with no choice.
Or you can support FOSS LLMs.
Jesus Christ this is Windows-tier insane computing behaviour from Ubuntu. Fuck Ubuntu.
Removed by mod
Unsubscribe from politics communities temporarily? Or do you browse by All or something?
Which ones? What for?
Really the only service of such extreme convenience I can’t help but use it is cloudflare tunnels for quickly selfhosting and their cheap asf domain registrar.
Still, I don’t host anything via cloudflare per se even as a noobie.
Not sure what else you need.
Last time I had this was with Tyler’s IGOR vinyl I got in 2019 or so that had the extra track “Boyfriend” that wasn’t on Spotify, but I think it has since been released on tidal or some other obscure music service by niw.
Quickly became my fav track off the album too.