Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor"
engineeringideas.substack.com
This post includes the overview and the conclusion of the article that was posted on LessWrong. Overview In section 2, I describe the “exemplary actor”, an LMCA (language model cognitive architecture) that takes a simple, “brute force” approach to alignment
Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor"
Aligning an H-JEPA agent via training on the…
Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor"
This post includes the overview and the conclusion of the article that was posted on LessWrong. Overview In section 2, I describe the “exemplary actor”, an LMCA (language model cognitive architecture) that takes a simple, “brute force” approach to alignment