Everyday AI Podcast – An AI and ChatGPT Podcast - Ep 541: AI & Trust: When 98% accuracy won't cut it and how Sage can fix it
Episode Date: June 6, 2025Your CFO just lost sleep over a single missing penny... again.Here's the thing about finance teams: they'll hunt for days to find ONE CENT that's off in their books. Because in accounti...ng, even 98% accuracy = complete failure.So when it comes to your company's finances and AI, there's a HUGE elephant in the room: trust. Sage is changing the conversation around AI, trust and your books. Sage is a global leader in cloud-based accounting, financial management, and business management solutions. Sage CTO Aaron Harris joins the Everyday AI show to show us the new recipe for trust they're cooking up.Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Have a question? Join the convo here.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:AI Trust Issues with Financial AccuracySage's 7-Billion Parameter Model TrainingSage Copilot's Accounting AI AccuracyTransparent Trust Labels in AI UsageFinancial Leaders' Trust in Sage's AISage's AI Factory Safety MeasuresSage's Industry Collaboration for AI AccuracyAI Implementation Strategy in AccountingTimestamps:00:00 AI Trust and Business Accuracy03:08 CFO's Role in Trust Building08:14 "Leveraging AI for Financial Growth"12:23 "Enhancing AI Trust in Finance"15:56 Early Machine Learning Infrastructure Pioneers17:53 "Sage AI Factory Overview"22:48 AI Transparency and Data Safety24:12 "Trust Label Eases Customer Evaluations"26:55 "Insights on AI and Industry"Keywords:AI trust, 98% accuracy, business leaders, Sage Future Conference, Atlanta, trust in AI, Sage Copilot, accounting software, global software company, Newcastle, North America headquarters, CFO, finance team, financial reports, forecast, budgets, credibility, financial accuracy, creative accounting, large language models, CHAT GPT, task-based AI, accounts payable automation, invoice reading, data science, AI development, neural models, conversational interface, GPT billions of predictions, generative AI, deterministic AI, billions of documents, fine-tuned models, accounting expertise, AICPA partnership, AI factory, automated machine learning, observability, model drift, hallucination detection, Sage AI factory, industry trust signals, safety mechanisms, customer by customer basis, Sage trust label, transparency labels, trustworthiness, ethical AI, responsible AI, AI safety, AI innovation, industry standards, problem-solving, financial trustworthiness.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)
Transcript
Discussion (0)
This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips.
Listen daily for practical advice to boost your career, business, and everyday life.
Meet Firefly AI Assistant, now live in Adobe Firefly, the All In One Creative AI Studio.
Just describe what you want to create and the assistant handles the rest,
orchestrating multi-step workflows across Photoshop, Premiere Express, and more in one conversational interface.
You direct the outcome.
The assistant accelerates execution.
What if good isn't good enough, right?
I think it's something that business leaders are constantly thinking about when it comes to AI.
They're like, hey, if we get this right most of the time, let's go ahead and roll this out to the entire organization.
And sometimes that might be okay, right, if you're doing strategy, creative work, content production.
Well, what about your books?
What about your finances?
Sometimes being 90 or 95% correct could be bad.
It could be a recipe for disaster when it comes to your business's AI plan in 2025 and beyond.
That's why today I'm excited to talk a little bit about trust in AI and how, well, if you're watching our video, our live stream, you see I am at the Sage Future.
conference here in Atlanta and how Sage is really helping increase everyone's ability to trust
their AI. All right. I'm excited for this conversation. I hope you are too. What's going on,
y'all? My name's Jordan Wilson. I'm the host of Everyday AI. This is your daily live stream
podcast and free daily newsletter helping everyday business leaders like you and me, not just learn
what's happening in the world of AI, but how we can actually leverage it to grow our companies and our
careers. So make sure if you haven't already, go to our website at Your EverydayAI.com. We're going to be
recapping today's conversation and a whole lot more. But like I said, you can probably see a little
different setup here. I am in Atlanta at the SAGE Future Conference. And I'm excited to welcome our guest
for today. Aaron Harris, the CTO of SAGE. Aaron, thank you so much for joining the Everyday AI show.
Thanks for having me. Really excited to be here. Yeah, on the road in your hometown, actually.
But for, you know, for those of our audience that maybe don't know Sage, tell us what Sage is.
Sure. Yeah. So Sage is a global software company that focuses on accounting.
HR, payroll, manufacturing, sort of all of the things that a finance and accounting team needs to run the back office of the business.
So we're actually a British company.
We're headquartered in Newcastle, England.
But we've got offices all over the world.
And our headquarters here in North America is obviously here in Atlanta.
It's a company that's been around for a while.
We've been building accounting software for more than 40 years now.
The company that I co-founded that sort of got acquired in, we started more than 25 years ago.
So we've been doing this for a while.
We're not as well known in the U.S.
Because the U.S. is a huge market.
And there's sort of more players in this market.
But if you go to the U.K. or Spain or France or Germany,
like some of those countries were a bit of a household name
within the accounting industry.
Yeah.
And let's just kind of skip to the end.
When it comes to the intersection of AI in accounting,
why is sometimes being 90 or 95% accurate?
it, why does that not work?
Yeah, so, I mean, there's so many ways to address that question.
I think the first way to start is that, you know, look at the CFO in a company and the finance team,
that CFO trades on trust, right?
Their job is to create confidence not only within the business, but with stakeholders,
whether it's investors, creditors, right, that you can rely on the accuracy of the financial
reports that they're issuing. Internal stakeholders can rely on the forecasts and the budgets,
right, that are being provided. And the minute that CFO puts something out that, you know,
has a mistake in it, they're going to lose that credibility. They lose that trust. And so the bar is
incredibly high. One of the things is kind of interesting in the mindset of a CFO and a finance team.
if I'm a penny off in the basic equations of accounting, they will hunt that penny down
for days until they find it.
They may not ever give up until they find that penny.
So 99% yeah, that's not going to cut it, right?
It's got to be perfect.
Yeah.
And not only that, right?
And I'm sure many of our audience can relate to this.
Large language models by themselves, right?
So if you're using something like chat, GBT, not always the best at math, right?
Yeah, no, not at all.
all. Yeah, in fact, you know, large language models are trained. I mean, this is what makes them so
amazing. They're trained to be creative. We don't want creative accounting. And we certainly
don't want them doing math, right? So, you know, in the way that we build AI, this is something
that I often tell audiences. One of the first rules of building trusted AI is not to use AI when
traditional development will work better. So if you want AI to do math, we're going to give AI a calculator
to do that math with. And, you know, I want to get more into the, you know, I want to get more into
this piece of trust. But, you know, I saw your keynote. And when you were showing some of the
things on screen, I'm like, wait, I need that, especially when it came to Sage co-pilot. So can you
explain a little bit for maybe our audience that doesn't use Sage and your AI offerings? What the
heck? Like, how can you do that with such high accuracy and, you know, help people kind of close the
books, you know, much faster? You know, I think you said originally it was like two to three
weeks and now down to like two to three days. Yeah. Yeah. We're still out. We want to get rid of it.
Right, right. It's a relic, right? It's archaic. We want to get rid of this thing. So we think about
this in waves, right, in the way that we build and deploy AI. And the first way we call task-based
AI. This is AI that you don't necessarily see at work. So the first thing that that we built
in the world of accounts of payable automation was was AI to read and categorize an image.
boys. And my first conversations with the data science team was, okay, there's a bunch of models
right from big players that are built just to do this. Why are we not using one of those? And, you know,
what they had to convince me of was that those models just weren't good enough, right? They were about
80 percent, 75, 80 percent accurate. But in addition to that, you know, that wasn't an even,
that wasn't evenly, right, deployed accuracy. They really struggled to find the total.
on an invoice, right? They might be 30 or 40 percent accurate on that. Turns out it's kind of hard to find. And so ultimately, we had to go build our own models, models, plural, right? We've got five models just, you know, some of them are looking for the total. Some of them are checking the work of the other models to make sure that, yeah, you really did find the total. So, you know, it ended up being dozens of models. Now, you know, that the limitation of this approach is that you can't really interact with that AI. And that AI has to be very, very,
very carefully orchestrated, scripted.
It does what it's told to do exactly the way it's told to do it.
We can't really get to the next level of automation until you can interact with AI.
And so that's the big breakthrough with large language models that sit behind Sage co-pilot.
Now you can be directive of the way AI works and sort of how it gets its job done.
And so that's, you know, it's a huge breakthrough.
And it's really a change of psychology, if you will, and the way you design the product.
When we are in that task-based phase, you know, a lot of customers don't even realize how much AI is actually operating behind the scenes.
Right. They don't realize that until they're confronted with a conversational interface.
And so that, like, totally changes the way you design. You've got to design for confidence now.
Yeah. And how did we get to the point where you have a product like Sage co-pilot that can accurately, you know, take advantage of, you know,
know, the powers of generative AI, yet work in a more almost deterministic, right, like,
way. But, you know, I saw that, you know, there's been billions of predictions, millions of
documents that you've used to get here. So without, you know, going into, because I'm sure we can
talk about this part for hours. But how did we get to the point where, yes, you can feel confident
as one of the global leaders in the space to say, yeah, you can go use AI for some of your most
important financial tasks.
Yeah.
So I think there's two parts to that question or two answers to that question.
But the first, I guess, I will get back to is how do you design the product?
Right.
You sort of have to, like, you have to be credible and believable with your customers.
And so if I stood in front of, you know, our customers today and say, you know, don't worry,
it's 100% accurate.
Like, you know, we've got it.
They wouldn't believe at all, right?
So you've got to adapt the experience.
You've got to design around this understanding.
that, hey, we're going to get that large language model to be incredibly accurate.
But what we're going to do is we're going to overindex on understanding, okay, have we met that level?
And if not, you know, how do we engage a human properly to review the work?
And so that is such a huge, huge part of it.
But the other part of it is that, again, the off-the-shelf models, right, as good as they are,
as magical as they are, as powerful as they are, they're not quite good enough, right, for what we needed.
But we need a large language model that knows, like, in depth, with expertise, how our products work.
It needs to be an expert at our APIs.
Right.
So in the process of completing a task, it's probably going to write some code on the fly, right, to use an API.
And so, like, it's not going to work if it hallucinates in the process of, you know, building that API request.
And so all we found was these off-the-shelf models are pretty brilliant.
But, you know, two problems.
First, as brilliant as they are, they still make mistakes.
But second, you know, they're incredibly expensive, right?
So if we wanted to go about, you know, not just operating these models, but sort of get into the world of building one of these gigantic large-leg language models, I'd never get the budget to do that.
Right?
What, $100 million?
A lot of money to train those.
Yeah.
Trillion parameters cost a lot.
Yeah, yeah.
And so, you know, fast forward two years.
And the, you know, the efficiency.
and the capability of, you know,
fine-tuning these models has increased rapidly, right?
So costs have come down, efficiencies is increased,
but also the tools available to companies like us
have gotten better and better and better.
And so we took, you know, so GPT, as you mentioned,
trillions of parameters.
You know, GPT4, we think two trillion, probably.
We started with a model that's seven billion parameters.
Right, and we fine-tuned it from there.
And when you're fine tuning, like you can sort of like sluff off the stuff that you don't want it to do.
Right.
We train our model to like not accept toxic prompts, right?
We train it to be pleasant in the way that it interacts.
And then, you know, if the conversation is not about accounting, then we don't want to have that conversation.
So we can get it down to a 7 billion parameter model and we can fine tune it to be really, really, really good at these accounting tasks we want it to do.
You know, it's interesting because I'm sure there's a lot of people in our audience, specifically, you know, CFOs, people who work in finance that maybe their first or one of their first interactions with, you know, AI, they were probably saw a result and they're like, I'm never going to touch it again, right? Because some of the earlier, right, even if you say something like GPT4 or some of these, you know, earlier trillion parameter models, they couldn't do basic math, right? So I think even a lot of people that I talk to, they kind of wrote
off, you know, hey, we're not going to use AI in this department anymore. But it sounds like the,
one of the models that you have powering your SAGE co-pilot, I mean, it sounds like it has like a,
like a PhD and in CPA, right? Like talk a little bit more about how you were able to address
that trust issue by essentially going through and training this seven billion parameter
model to become an expert. Yeah. And I want to talk about that first experience too. But, but,
So what we've done is we've taken that base model and then we've trained in all the product
documentation and sort of loads of material around that product documentation about best
practices and how products work, you know, all the developer code.
We've trained it on accounting textbooks and accounting exams.
We've trained it on content that helps it to sort of understand and speak in the vernacular
of, you know, accountants and financial analysts.
And one of the things is super exciting that we announced today is.
is we're partnering with the AI CPA, which is the industry association that accredits CPAs.
They're now going to make their professional content available to us to train into the model.
Now, it's a proof of concept.
We're going to be conservative as we are when it comes to AI.
So I can't sit here and say, like, this is exactly what we're predicting.
But I think it's an incredible signal that, you know, the accounting industry, which, you know,
early on, like, you know, the headlines were saying accountants aren't going to exist.
I think it's incredibly interesting that the accounting industry is not just sort of embracing
AI. They're contributing to the development of AI models. But I want to kind of come back to that
first experience because it's so critical. You're absolutely right. If a CFO uses our co-pilot and
their first experience says it makes a big mistake, we won't get another chance. I grew up in Silicon
Valley, probably like a lot of people in your audience. And, you know, the mantra in Silicon Valley has
always been move fast and break things. When you're building this kind of AI in this industry,
we've got to like have a completely different culture. So I talk about, it's kind of pithy,
but like accept humility, embrace responsibility. We have to have a different mindset. We can't
rush AI to our customers. If they have that bad experience, they won't come.
back. So you've talked a little bit on how you were able to increase accuracy, right, by creating and fine-tuning
your own model, trained on everything that anyone in finance, CPA, et cetera, really cares about.
But what about on the back end? What about, you know, observability, like traceability? You know,
how does, you know, Sage co-pilot and some of the things that you announced today, specifically kind of
this trust label? Yeah. How does that address?
Yeah. So one of the things that we had to do very early is, you know, we had to build our own
infrastructure for machine learning. So I kind of like to compare this to the early days of
software as a service. So if you go back, you know, to the very earliest days, you know, all of us,
you know, Salesforce included and the other pioneers that are kind of still around, we, we had
to set out an objective for our developers that we would release a new version of the code.
on a weekly basis, we would upgrade every customer to that next version of the code automatically,
so everybody would always be on the same version of the code.
And we had to do this without disruption.
This sounds normal now.
Like 25 years ago, that was not normal.
Like, that was an incredibly provocative thing to say.
So if you fast forward, when we started building AI,
I had to give the engineers a different mission that was even more provocative, I believe.
It's like I don't want a weekly release.
I want you to automate the training of this AI and detect when it's improved enough and
then automatically update the version.
But it gets worse, Mr. Developer.
Like, you need to be able to do this on a customer by customer basis, right?
So we're going to have some big models that are trained, you know, from the collective.
But a lot of what we do, we need to train from the individual customer on a customer by
customer by customer basis. So you've got to build this infrastructure that can automate all that,
but do it in a safe way. So we built all the automation. This is why we've got tens of thousands
of models in production today. But what we also did was we built in all of these safety mechanisms,
all of these controls. So we've got systems that detect model drift, right, and launch a process
to get a data scientist involved. We've got a couple of safety mechanisms that detect hallucination.
This is where we've actually gotten some of our patents on this.
So we call this whole thing the Sage AI factory.
And if you see me talk to our customers or analysts, right, partners about AI,
I'm invariably going to talk about the Sage AI factory because I think it's so important
to understand behind the scenes like how does the factory work, right?
How does the stuff get built?
And how do I know that you're taking steps to make sure that it's safe?
So I'm curious, how many total?
organizations you have using kind of the AI co-pilot features inside SAGE right now?
So we have tens of thousands using co-pilot in various places around the world.
We started small. We started with sort of small businesses that have simple accounting needs.
We started with small accounting firms that tend to serve those businesses. And we started on sort of
the early capabilities. And over time, we've expanded it to more products, you know, more
countries, but we've also sort of gone into now more sophisticated businesses. So we launched
early access for Sage Intact at about about six or eight months ago. And I guess the thing that I
would want to reinforce here is we're being very deliberate about how we open that to more,
to more customers. We're just, we're so, we're so careful to not have that first bad
experience. And so we're going to be very deliberate in the way we rolled this out to more customers
over time. Yeah, it seems like a super strategic rollout and obviously makes sense when trust is
paramount, right? And you can't, like you said, you can't have someone have that first bad experience
where something goes wrong because the stakes are so high. You know, so I'm curious throughout the
iterations of Sage co-pilot and then, you know, obviously another wave coming with what was announced
here at Sage Future. What were maybe some of the initial?
maybe obstacles that you were able to overcome.
And then maybe what do you think is next, right?
In terms of not like, hey, what's the exact product roadmap, which I know you guys did
lay that out a little bit.
But in terms of trust, in terms of reliability, what have you guys already been able to
overcome and then what's next to overcome?
Yeah.
Yeah.
So if you'll forgive me, I'm going to tell a story.
Please do.
And I promise it's setting up for the answer.
So I've been talking about, you know, Sage intact.
and I was one of the co-founders there.
So 25 years ago, we just started the product.
We just launched the product.
Our biggest objective, the biggest obstacle to getting customers to buy the product
is that they were not willing to put their data in the cloud.
And, you know, we sort of scratch our heads at that now,
but sort of you have to imagine how new this was at the time.
And we would use all the arguments you'd expect.
We can spend more on security than you can.
Our livelihood depends on keeping your data safe.
You trust a bank with your money.
Why not trust us with your data?
It just wasn't working.
Yeah.
And so what we ultimately ended up doing, and I think it's pretty clever, we put a button
in the product that said, see my data.
And when a user clicked that button, we would pop up a window that had a webcam in the
data center pointing at the server that had their data on it.
Now, we only had one server at that time with customer data.
So, you know, this was pretty easy to do.
Then this is actually pretty awesome.
It actually kind of worked.
Thankfully, we reached a point where it was less needed when we turned it off.
Now, the reason why we turned it off, and I promise, again, true story, we had a technician
in the data center who was doing some cabling and doing some maintenance, sort of bending over
and moving around a bit.
Probably should have been wearing a better belt.
And, you know, sales rep chose at that point to click in front of a prospect, right?
You know, show my data.
And they got the full transparency.
The full transparency.
They saw the data and then some.
Right.
And so, like, this is kind of like an old story about building trust and, you know, how, you know, how transparency plays into that.
You know, today, it's still true.
If you talk to companies today and they're evaluating your software and they know that AI is a big part of the offering, they want to know that you're going to keep your data safe, right?
They're going to want to know that they can trust the technology.
And the problem with AI is the industry and the technology is moving so fast that sort of the regulatory environment around it hasn't kept up.
So we can't rely on a lot of sort of external signals, right, that you can trust the AI.
We do use some and I'll get into that.
So what we determined was, well, we need to put a button in the product.
We're going to put it in each AI feature where a user can click it.
And what's going to happen is instead of that webcam, we pop up what we call the Sage Trust
label.
Yep.
And in that trust label, this is kind of like a nutrition label, we're going to be super
transparent about, okay, what are the models that we use to build this?
Are we training our own models?
You know, if so, how do we use your data and the training of those models?
You know, what are the steps we're taking to keep your data safe?
What are the safeguards in place to defend against, you know, issues of bias or other ethical
concerns?
and we're putting it in a nice, easy-to-read format.
And if they want to learn more, we give them a button
where they can click and go out to read all of our AI commitments,
kind of solving the same problem, really.
So, yeah, we announced that today.
We're encouraging other vendors in the industry to kind of follow suit.
We can't wait for the regulatory bodies to catch up.
So we've got to plow ahead with something we think simplifies this
and sort of signals to the customer,
here's why you can trust us.
What do you think is going to be,
or maybe this is it, right?
Because I think every single company that's trying to,
you know, put out responsible,
trustworthy AI products,
there's always that hurdle, right?
With the trust label,
is this something that you think is going to be,
not like the last hurdle because there's always innovation,
there's always, you know, new capabilities,
but is this going to be one of those, you know,
those hurdles that after you get over it, you're like, wow, this made quite a difference for people
in finance to be able to trust AI in their data. I think it's going to help a lot. I mean,
I think the, you know, the first thing that it's going to do is it's going to get the CTO off
the phone. So, you know, when you get a customer who's evaluating your software, and we have
millions of customers, by the way, right? We've got 4,000 people in sales. If we get a sophisticated
customer that's got a question about AI, like, they're going to ask me to get on the phone and
talk to that customer and explain it. If we've got this trust label, it just makes it easy,
right, for people to understand and evaluate. So I think that's critical. I don't think it's going to go away.
I mean, I think it's going to evolve and change. You know, is this the moment where we solve
the issue for good? You know, is there going to be a point where I can turn off that button the
way I turned off the webcam? I don't know. But I guess we're, we're, there's a point where I can turn off the
Everything is happening so fast with AI.
It's progressing so fast.
And I think we all need to be a bit honest, right, that there's going to continue to be
lots of reasons to not trust AI, right?
It's just broader than just the accounting field.
So, you know, I think we've got to have this mindset that this isn't, you know, this is one
step in the journey.
And we're going to have to continue evaluating and looking at, you know, okay, how are people
feeling about trust in AI?
and what are the new things that are causing them to not trust it?
And we're going to have to just keep adapting as we go.
So, Aaron, we've talked about a lot in today's conversation.
So, you know, everything from trust and transparency to even how Sage is not just, you know,
building their own models, but how they're showing their work to customers on how it's being used
and how it's being implemented.
But, you know, as we wrap up, what do you think is the one most important takeaway for those people,
whether they're, you know, a CPA, whether there's a CFO,
at a huge organization, what's the one biggest takeaway that you want people to know from,
you know, Sage, Future here when it comes to trust in AI?
So I think, you know, the one thing is the future is the future that we built.
And sort of taking on the responsibility to build that future is pretty serious.
And so that's, you know, why I'm having this big conversation with you.
What we've learned through all of our conversations is the biggest signal of trust is the company
behind the AI, right?
I'm not a sophisticated person, maybe.
I don't know how to evaluate this,
but hey, that's a brand that I trust.
And it's got to change your mindset, right?
You've got to be transparent.
You've got to be credible.
You've got to be willing to admit that, hey, AI is not foolproof.
It's going to have problems.
And then you've got to make these commitments that you publish
and you've got to stand behind them.
I think that was such an insightful look into not just what's
happening here at Sage and their co-pilot and everything happening at Sage Future, but also just
the industry as a whole. I think it's important for people to hear, yes, authenticity and transparency
are just as important as productivity gains and everything else that you get from AI. So,
Aaron, thank you so much for taking time out of the very busy Sage Future conference to join us.
We really appreciate it. Thanks for giving me a chance to talk about my favorite time.
I love it. I love it. So there was a lot happening in this.
conversation. Maybe you missed a golden nugget that Aaron just dropped on us. Don't worry. We're
going to be recapping it all in our newsletter as well as everything else that was announced
here at Sage Future. And it's not just if you're a CFO or a CPA, even if you're a small,
medium-sized business owner, a lot to know that was just announced. It's all going to be in our
newsletter. So thank you for tuning in. Please join us back tomorrow. And every day for more,
Everyday AI. Thanks, y'all. Meet Firefly AI assistant. Now live in Adobe Firefly, the Allman
One Creative AI Studio.
describe what you want to create in your own words and the assistant handles the rest,
orchestrating multi-step workflows across Adobe Creative Cloud apps,
including Photoshop, Premiere Express, and more in one conversational interface.
You direct the outcome while the assistant accelerates execution.
Stand control with the ability to step in and refine at any time.
See it today at firefly.adobie.com.
And that's a wrap for today's edition of Everyday AI.
Thanks for joining us.
If you enjoyed this episode, please subscribe and leave us a rating.
It helps keep us going.
For a little more AI magic, visit Your EverydayAI.com and sign up to our daily newsletter so you don't get left behind.
Go break some barriers and we'll see you next time.
