Codesota · RL Environmentshumor preference alignment← All environments
§ private

LOL Arena.

An environment for humor preference alignment. Not currently scorable for discriminative power.

not yet launched

§ Work with us

Need one that still separates models?

When the public environment for your capability saturates, you can’t tell your models apart and you can’t train past it. We build private, contamination-resistant, verifiable-reward environments and evals on a hold-out set — designed to discriminate where the public ones no longer do.