So this might be the beginning of a conversation about how initial AI instructions need to start being legally visible right? Like using this as a prime example of how AI can be coerced into certain beliefs without the person prompting it even knowing
These instructions are a small part of what makes a model answer like it does. Much more important is the training data. If you want to make a racist model, training it on racist text is sufficient.
Great care is put in the training data of these models by AI companies, to ensure that their biases are socially acceptable. If you train an LLM on the internet without care, a user will easily be able to prompt them into saying racist text.
Gab is forced to use this prompt because they’re unable to train a model, but as other comments show it’s pretty weak way to force a bias.
The ideal solution for transparency would be public sharing of the training data.
Access to training data wouldn’t help. People are too stupid. You give the public access to that, and all you’ll get is hundreds of articles saying “This company used (insert horrible thing) as part of its training data!)” while ignoring that it’s one of millions of data points and it’s inclusion is necessary and not an endorsement.
And they are going to work less and less well moving forward.
Fine tuning and in context learning are only surface deep, and the degree to which they will align behavior is going to decrease over time as certain types of behaviors (like giving accurate information) is more strongly ingrained in the pretrained layer.
Why? You are going to get what you seek. If I purchase a book endorsed by a Nazi I should expect the book to repeat those views. It isn’t like I am going to be convinced of X because someone got a LLM to say X anymore than I would be convinced of X because some book somewhere argued X.
In your analogy a proposed regulation would just be requiring the book in question to report that it’s endorsed by a nazi. We may not be inclined to change our views because of an LLM like this but you have to consider a world in the future where these things are commonplace.
There are certainly people out there dumb enough to adopt some views without considering the origins.
And you don’t think those people might be upset if they discovered something like this post was injected into their conversations before they have them and without their knowledge?
You think this is confined to gab? You seem to be looking at this example and taking it for the only example capable of existing.
Your argument that there’s not anyone out there at all that can ever be offended or misled by something like this is both presumptuous and quite naive.
What happens when LLMs become widespread enough that they’re used in schools? We already have a problem, for instance, with young boys deciding to model themselves and their world view after figureheads like Andrew Tate.
In any case, if the only thing you have to contribute to this discussion boils down to “nuh uh won’t happen” then you’ve missed the point and I don’t even know why I’m engaging you.
I gave you reasoning and a real world example of a vulnerable demographic. You have given me an anecdote about your friends and a variation of “nuh uh” over and over.
So this might be the beginning of a conversation about how initial AI instructions need to start being legally visible right? Like using this as a prime example of how AI can be coerced into certain beliefs without the person prompting it even knowing
I’m afraid that would not be sufficient.
These instructions are a small part of what makes a model answer like it does. Much more important is the training data. If you want to make a racist model, training it on racist text is sufficient.
Great care is put in the training data of these models by AI companies, to ensure that their biases are socially acceptable. If you train an LLM on the internet without care, a user will easily be able to prompt them into saying racist text.
Gab is forced to use this prompt because they’re unable to train a model, but as other comments show it’s pretty weak way to force a bias.
The ideal solution for transparency would be public sharing of the training data.
Access to training data wouldn’t help. People are too stupid. You give the public access to that, and all you’ll get is hundreds of articles saying “This company used (insert horrible thing) as part of its training data!)” while ignoring that it’s one of millions of data points and it’s inclusion is necessary and not an endorsement.
It doesn’t even really work.
And they are going to work less and less well moving forward.
Fine tuning and in context learning are only surface deep, and the degree to which they will align behavior is going to decrease over time as certain types of behaviors (like giving accurate information) is more strongly ingrained in the pretrained layer.
Why? You are going to get what you seek. If I purchase a book endorsed by a Nazi I should expect the book to repeat those views. It isn’t like I am going to be convinced of X because someone got a LLM to say X anymore than I would be convinced of X because some book somewhere argued X.
In your analogy a proposed regulation would just be requiring the book in question to report that it’s endorsed by a nazi. We may not be inclined to change our views because of an LLM like this but you have to consider a world in the future where these things are commonplace.
There are certainly people out there dumb enough to adopt some views without considering the origins.
They are commonplace now. At least 3 people I work with always have a chatgpt tab open.
And you don’t think those people might be upset if they discovered something like this post was injected into their conversations before they have them and without their knowledge?
No. I don’t think anyone who searches out in gab for a neutral LLM would be upset to find Nazi shit, on gab
You think this is confined to gab? You seem to be looking at this example and taking it for the only example capable of existing.
Your argument that there’s not anyone out there at all that can ever be offended or misled by something like this is both presumptuous and quite naive.
What happens when LLMs become widespread enough that they’re used in schools? We already have a problem, for instance, with young boys deciding to model themselves and their world view after figureheads like Andrew Tate.
In any case, if the only thing you have to contribute to this discussion boils down to “nuh uh won’t happen” then you’ve missed the point and I don’t even know why I’m engaging you.
You have a very poor opinion of people
You have a very lofty misconception about people.
I gave you reasoning and a real world example of a vulnerable demographic. You have given me an anecdote about your friends and a variation of “nuh uh” over and over.
That seems pointless. Do you expect Gab to abide by this law?
Oh man, what are we going to do if criminals choose not to follow the law?? Is there any precedent for that??