28 August 2025

Salesforce develops tools to evaluate AI agents

US – Customer relationship management (CRM) platform Salesforce’s AI research team has developed a simulation to test AI agents’ ability to perform in business scenarios.

graphic of robot using laptop against yellow background

The new benchmark, ‘CRMArena-Pro’, creates a simulated enterprise environment with synthetic data, to evaluate API calls to systems and safeguard personally identifiable information data.

Businesses can use the framework, which Salesforce said acts ‘much like a digital twin of business’, to test the accuracy and efficiency of AI agents.

The company has also launched a benchmarking tool to evaluate AI agents operating in contexts such as customer service and marketing, using metrics including accuracy, cost, speed, trust and sustainability.

Salesforce AI Research said these developments would help give businesses “the trust and tools they need to evolve into agentic enterprises”, i.e. organisations that use digital labour and AI to work alongside people.

We hope you enjoyed this article.
Research Live is published by MRS.

0 Comments

AI News North America

Powered by The Research
Buyers Guide

FIND YOUR NEXT AGENCY.

Advanced Search

Interviews

Newsletter

Stay connected with the latest insights and trends...

Sign Up

Featured Company from the RBG Directory

Town/Country: Enfield
Email: tara.lyons@indiefield.co.uk

Indiefield helps research teams run fieldwork they can trust, from straightforward studies to complex, specialist and international projects. Clients come to Indiefield for different reasons. Sometimes they need speed . . .

Latest From MRS

Our latest training courses

Our new 2025 training programme is now launched as part of the development offered within the MRS Global Insight Academy

See all training

Specialist conferences

Our one-day conferences cover topics including CX and UX, Semiotics, B2B, Finance, AI and Leaders' Forums.

See all conferences

MRS reports on AI

MRS has published a three-part series on how generative AI is impacting the research sector, including synthetic respondents and challenges to adoption.

See the reports

Latest

Themes

Specialisms

Regions

About

Sign in/Register

Search