KPMG pulls report on AI usage due to apparent hallucinations
Posted by Brajeshwar 3 days ago
Comments
Comment by simonw 2 days ago
I guess nobody ever got fired for paying KPMG and friends for an expensive report that supported their priors.
Comment by a_bonobo 2 days ago
Comment by petre 1 day ago
Comment by tjwebbnorfolk 2 days ago
The purpose of paying for these reports is for executives to have someone else to blame when their idea doesn't work. It has nothing to do with the correctness of the content.
Comment by overfeed 2 days ago
That's accurate, for the first draft. Similar to big legal firms - subsequent versions are signed-off and passed up (and if revisions request, down) the hierarchy, each stratum with its own billing rate(s).
Which makes me wonder when the hallucinations got added.
Comment by tjwebbnorfolk 2 days ago
Comment by overfeed 2 days ago
It can't have been at any of the big 4, because partners aren't skipping 4+ org-chart layers to look at draft documents written by early-career associates. I have no experience with body shops - if that's where you were.
Comment by NoPicklez 2 days ago
The purpose of most reports are absolutely for Assurance to decision makers or management and often times, we disagree with management or provide a view that might not favorable. Which just reflects the realities of what we have identified or tested.
As I said, this seems like thought leadership dribble which absolutely even as someone who has worked in Big 4, I think they're pretty average.
Comment by esseph 2 days ago
Comment by SpicyLemonZest 2 days ago
Comment by yieldcrv 2 days ago
I mean basically. KMPG is a regulatory checkmark in some industries
Comment by gdulli 2 days ago
Well they were true to their word about demonstrating a new and increasingly relevant definition of "excellence."
Comment by adham541 2 days ago
Comment by ChrisArchitect 2 days ago
Comment by jruohonen 3 days ago
Comment by Scoundreller 2 days ago
Comment by bigfatkitten 2 days ago
Comment by XenophileJKO 2 days ago
I use this regularly for my personal financial research system. Even flagship models make mistakes. Though currently the issue is usually the model using a figure from and older report. Cross-check reduces that dramatically.
Comment by watwut 2 days ago
Comment by modzu 2 days ago
Comment by figassis 2 days ago
Then have another set of agents, with skills like web browsing (to verify that links actually exist, maybe that references and abstracts actually match, etc), have one engineer (or agent) write a small script to help with this (just make sure you test it, and a bit).
So your work is not verified until your references table is 90% green checkmarks, maybe with uncertainty figures.
A human can then verify the ones with under 90% certainty.
This alone gets you a long way there. Does not costs the millions they're being paid.
It's quite interesting that these companies marketed themselves as them best of the best in excellence, accept no mistakes. I can imagine the countless keynotes and books about this. Or the sales pitches.
Has always been a lie, they just understood how to hide it. Today they don't, and it's embarrassing.
Comment by e12e 2 days ago
How about the author actually reads the finished report a couple of times and checks all the references?
It really is the lowest bar - even lower maybe than running a spell check.
Comment by SpicyLemonZest 2 days ago
Comment by e12e 2 days ago
Ed: thanks for the link - I hadn't seen that yet.
Comment by palmotea 2 days ago
But then you wouldn't be embracing the new agentic ways of working!
Comment by flowbarai 2 days ago
Comment by iugtmkbdfil834 2 days ago
I kinda get it, without experience and trying, how are they to know ( unless they are already 'into it')? After all, corporate training is laughable at best.
Comment by QuadrupleA 2 days ago
Comment by overfeed 2 days ago
Comment by cryo32 2 days ago
Comment by SwuduSusuwu 2 days ago