Salesforce develops tools to evaluate AI agents

US – Customer relationship management (CRM) platform Salesforce’s AI research team has developed a simulation to test AI agents’ ability to perform in business scenarios.

chatbot using laptop

The new benchmark, ‘CRMArena-Pro’, creates a simulated enterprise environment with synthetic data, to evaluate API calls to systems and safeguard personally identifiable information data.

Businesses can use the framework, which Salesforce said acts ‘much like a digital twin of business’, to test the accuracy and efficiency of AI agents.

The company has also launched a benchmarking tool to evaluate AI agents operating in contexts such as customer service and marketing, using metrics including accuracy, cost, speed, trust and sustainability.

Salesforce AI Research said these developments would help give businesses “the trust and tools they need to evolve into agentic enterprises”, i.e. organisations that use digital labour and AI to work alongside people.

We hope you enjoyed this article.
Research Live is published by MRS.

The Market Research Society (MRS) exists to promote and protect the research sector, showcasing how research delivers impact for businesses and government.

Members of MRS enjoy many benefits including tailoured policy guidance, discounts on training and conferences, and access to member-only content.

For example, there's an archive of winning case studies from over a decade of MRS Awards.

Find out more about the benefits of joining MRS here.

0 Comments


Display name

Email

Join the discussion

Newsletter
Stay connected with the latest insights and trends...
Sign Up
Latest From MRS

Our latest training courses

Our new 2025 training programme is now launched as part of the development offered within the MRS Global Insight Academy

See all training

Specialist conferences

Our one-day conferences cover topics including CX and UX, Semiotics, B2B, Finance, AI and Leaders' Forums.

See all conferences

MRS reports on AI

MRS has published a three-part series on how generative AI is impacting the research sector, including synthetic respondents and challenges to adoption.

See the reports

Progress faster...
with MRS 
membership

Mentoring

CPD/recognition

Webinars

Codeline

Discounts