Discussion about this post

User's avatar
YI PENG's avatar

If someone watched/memorized how Qwen gave birth 3 years ago, the debut model itself was previewed/announced as a personalized LLM. For half a year, I’m responsible for its character in 14b size, also as the suggested model for game, edu, and smart devices. I even invited one of the most trending stand-up comedians/professors to be part of the post-training /alignment corpus. However, at that time, people didn't realize RL could scale up like today’s~~~

Kai Williams's avatar

> You may not have the exact methodology, but you have the outcomes of those methodologies.

It's also worth noting that Anthropic's Evan Hubinger was the last author on the open source replication of character training! Not quite the same as constitution, but not that different either

(Incidentally, Nathan Lambert was the third author)

https://arxiv.org/pdf/2511.01689

17 more comments...

No posts

Ready for more?