Salesforce develops tools to evaluate AI agents
The new benchmark, ‘CRMArena-Pro’, creates a simulated enterprise environment with synthetic data, to evaluate API calls to systems and safeguard personally identifiable information data.
Businesses can use the framework, which Salesforce said acts ‘much like a digital twin of business’, to test the accuracy and efficiency of AI agents.
The company has also launched a benchmarking tool to evaluate AI agents operating in contexts such as customer service and marketing, using metrics including accuracy, cost, speed, trust and sustainability.
Salesforce AI Research said these developments would help give businesses “the trust and tools they need to evolve into agentic enterprises”, i.e. organisations that use digital labour and AI to work alongside people.

We hope you enjoyed this article.
Research Live is published by MRS.
The Market Research Society (MRS) exists to promote and protect the research sector, showcasing how research delivers impact for businesses and government.
Members of MRS enjoy many benefits including tailoured policy guidance, discounts on training and conferences, and access to member-only content.
For example, there's an archive of winning case studies from over a decade of MRS Awards.
Find out more about the benefits of joining MRS here.
0 Comments