Discussion about this post

User's avatar
Little Kenny's avatar

At a first glance, it seems that a challenge to extending the concept of an anatomical compiler to an AI alignment compiler is the heterogeneity of AI models. A specific instance of an anatomical compiler, I presume, would be purpose-built for a specific type of cellular network in a specific type of organism, consisting of a specific set of cell types and signaling pathways exchanging specific types of messages. Whereas there is a whole world of AI models beyond the headlining LLMs coming into existence with an endlessly wide variety of meanings symbolically encoded in the information being exchanged with other humans and AIs in each of their respective environments.

(Although to be complete, I could also mention here that the electricity supply and other elements of keeping the hardware running, and/or the economic tokens some of them are given with which to subcontract other digital services, could be implemented into AI models as meaningful signals, and this realm would be more homogeneous across them all.)

Are there any thoughts on this?

Little Kenny's avatar

Has there been any thought given to whether an AI alignment compiler would be put into use while a model is being trained, or post-training (i.e., when it’s deployed), or both?

2 more comments...

No posts

Ready for more?