Mistral 7B v0.2 has embedded ethical guidelines

sapient [they/them] · edit-2 1 year ago

Mistral 7B v0.2 has embedded ethical guidelines

exu@feditown.com · 1 year ago

Has anyone objectively compared the v0.1 and v0.2 instruct models yet? I did seem to get slightly better output with the v0.2, but I just started playing with llms recently.

martinb@lemmy.sdf.org · 1 year ago

Thanks. As a new ollama user, this is very helpful

rufus@discuss.tchncs.de · edit-2 1 year ago

Don’t wait for Mistral AI to publish information on their models. I think they always just drop them and maybe follow up with benchmarks. Something we could calculate ourselves. But not useful information.

Have you tried “jailbreaking” it? I’d think that could give some more insight. For example how deep the safety precautions are embedded. And what kind. Does it just roleplay the helpful assistant and can be nudged into other roles easily, or is it tuned to make it next to impossible to circumvent this?

sapient [they/them] · 11 months ago

I haven’t really messed around trying to jailbreak the new weights. I switched back to the old ones pretty quick ^.

I am running this stuff on a pre 2015 cpu, so I tend to get about 2 tokens/sec output so experimentation can be slow, and i have somewhat limited space on my SSD (cus its fairly full). So I tend to delete and redownload models and, well, they’re fairly large and its annoying :p

Experimenting is doable but i’ll leave it to someone else for now ^., got other things to do. But if anyone else wants to i’d encourage them to reply to this post with more details on the embedded safeties, I certaiy would be interested.

rufus@discuss.tchncs.de · edit-2 11 months ago

Well, we’re kind of in the similar boat. I have a PC and a laptop with Skylake CPUs in them. I don’t know when I bought them, that generation is from 2015 so must be around 2016.

I bought 32GB of additional RAM for the PC since RAM has become quite cheap. That allows me to keep KoboldCpp loaded all the time and I can store the models on a slow spinning 6TB harddisk.

I think I get like 4 tokens per second. And I’m fine with that. KoboldCpp’s “ContextShift” feature has helped me generate longer texts in a chatbot-scenario since now I don’t have to re-process all of the input text that often.

But you’re right. Experimentation is kinda slow on machines like that. I don’t think I want to buy a GPU and also a new PC that matches that. I thought a moment about buying an old, used NVidia P40 for about 200€ but I don’t think it’s worth the hassle. I sometimes do experimentation, but I just rent a cloud GPU on runpod.io for like $1 per hour.