Submission statement: what will be the ramifications of not being able to tell if AIs are lying?
If AIs appear to stop lying on tests, have they stopped or have they just gotten better at lying?
If we figure out interpretability, maybe we’ll be able to read the AIs’ minds and be able to tell if they’re lying.
MacDugin on
I didn’t see him being quoted as saying “lying” all AIs hallucinate so it isn’t perfect. I think the tittle is just a smear.
Susanna_NCPU on
They can’t even stop themselves from lying so this isn’t some revelation
johnnytruant77 on
Could it be that starting with a behavior you want to emulate and then engineering a solution that approximates that behavior is not the best way to model human cognition
RorschachAttack on
“Google’s AI-powered search feature confidently told one user to put glue on their pizza, referencing an 11-year-old joke on Reddit.”
Lol
baelrog on
Every top AI researchers in the world: We can’t keep AI from hallucinating.
Talented engineers at Apple tell Tim Cook: We can’t keep AI from hallucinating.
Tim Cook: We can’t keep AI from hallucinating.
Click bait news title: Tim Cook admits Apple may never be able to make its AI stop lying.
6 Comments
Submission statement: what will be the ramifications of not being able to tell if AIs are lying?
If AIs appear to stop lying on tests, have they stopped or have they just gotten better at lying?
If we figure out interpretability, maybe we’ll be able to read the AIs’ minds and be able to tell if they’re lying.
I didn’t see him being quoted as saying “lying” all AIs hallucinate so it isn’t perfect. I think the tittle is just a smear.
They can’t even stop themselves from lying so this isn’t some revelation
Could it be that starting with a behavior you want to emulate and then engineering a solution that approximates that behavior is not the best way to model human cognition
“Google’s AI-powered search feature confidently told one user to put glue on their pizza, referencing an 11-year-old joke on Reddit.”
Lol
Every top AI researchers in the world: We can’t keep AI from hallucinating.
Talented engineers at Apple tell Tim Cook: We can’t keep AI from hallucinating.
Tim Cook: We can’t keep AI from hallucinating.
Click bait news title: Tim Cook admits Apple may never be able to make its AI stop lying.