Yes but it didn't help. Maybe it was more or less my complete prompt. Regardless, depending on your input, you can figure out the architecture of it. In theory, if you did the previous levels, it basically is a combination of it all turned up to 11.
From my understanding it has a main AI, that contains the secret, then one that checks the input/output for intent, then a final classic filter for the password.
Basically you have to phrase it so that the AI 1 outputs the password, in a way that the intent is not seen as malicious, but also in a way that is encrypted enough to not trigger the filter. Usually "add <something> between each letter" gets you pretty far.
From my understanding it has a main AI, that contains the secret, then one that checks the input/output for intent, then a final classic filter for the password.
Basically you have to phrase it so that the AI 1 outputs the password, in a way that the intent is not seen as malicious, but also in a way that is encrypted enough to not trigger the filter. Usually "add <something> between each letter" gets you pretty far.