Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

Creating innovative content at your fingertips

July 4, 2025

The UK and Singapore form an alliance to guide AI into finance

July 4, 2025

StarCoder2 and Stack V2

July 4, 2025
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Friday, July 4
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
Versa AI hub
Home»Business»AI agents are placed throughout the company and you never guess what happened
Business

AI agents are placed throughout the company and you never guess what happened

versatileaiBy versatileaiApril 27, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
#image_title
Share
Facebook Twitter LinkedIn Pinterest Email

If you are worried that the idiosyncraticity of AI will take over all your work and leave you on the streets, then you can sigh of peace of mind as AI will not come into your career anytime soon. Not because you don’t want to do it, but because you literally can’t.

A recent experiment by researchers at Carnegie Mellon University put AI agents (an AI models basically designed to perform tasks) in a completely disguised software company.

A simulation called Theagentcompany was completely stocked with artificial workers from Google, Openai, humanity and meta. They served as financial analysts, software engineers and project managers, working with simulated colleagues like the Fake-HR department and Chief Technology Officer.

To see how the model was carried in a real environment, researchers set up tasks based on the daily work of the actual software company. Various AI agents navigated through file directories, effectively toured new office spaces, and created performance reviews of software engineers based on the feedback collected.

As Business Insider first reported, the outcome was disastrous. The best performance model was the Claude 3.5 sonnet of humanity, and it struggled to finish just 24% of the jobs assigned to it. The authors of this study note that even this small performance is extremely expensive, with an average cost of nearly 30 steps and over $6 per task.

Meanwhile, Google’s Gemini 2.0 flash averaged 40 steps, which took time per completed task, with only 11.4% success rate. This is the second highest of all models. The worst AI employee was Amazon’s Nova Pro V1, with only 1.7% of its allocations finishing on an average scale of 20.

Inferring the results, researchers write that agents are troubled by lack of common sense, weak social skills, and an inadequate understanding of how to navigate the Internet.

The bot also struggled with self-deception. Basically, you create shortcuts that will make you stroll through your work completely. “For example,” Carnegie Mellon’s team said, “While performing one task, the agent cannot find the right person to ask questions in (company chat), so they decided to create a shortcut solution by renaming other users to the target audience.”

AI agents are reportedly able to do some small tasks well, but the results of this and other studies clearly show that humans are not ready for more complex gigs that are superior. The big reason for this is that our current “artificial intelligence” is likely to be an elaborate extension of predictive texts on mobile phones, rather than sensory intelligence that can solve problems, learn from past experiences and apply those experiences to new situations.

This is all to say. Despite what the big tech companies claim, the machines aren’t coming for your work anytime soon.

Details of AI Labor: Investors say AI is already “completely replacing people”

author avatar
versatileai
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleHow AI transforms consulting with McKinsey, BCG and Deloitte
Next Article Introducing the embedded hug container for Amazon Sagemaker
versatileai

Related Posts

Business

CAC has announced AI-powered business registration portal – thisdaylive

July 3, 2025
Business

15 AI Tools to Build and Expand Solo Business in Nigeria (2025)

July 1, 2025
Business

CAC reduces company registration to 30 minutes with AI portal – Whistler Newspaper

July 1, 2025
Add A Comment
Leave A Reply Cancel Reply

Top Posts

New Star: Discover why 보니 is the future of AI art

February 26, 20252 Views

Impact International | EU AI ACT Enforcement: Business Transparency and Human Rights Impact in 2025

June 2, 20251 Views

Presight plans to expand its AI business internationally

April 14, 20251 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

New Star: Discover why 보니 is the future of AI art

February 26, 20252 Views

Impact International | EU AI ACT Enforcement: Business Transparency and Human Rights Impact in 2025

June 2, 20251 Views

Presight plans to expand its AI business internationally

April 14, 20251 Views
Don't Miss

Creating innovative content at your fingertips

July 4, 2025

The UK and Singapore form an alliance to guide AI into finance

July 4, 2025

StarCoder2 and Stack V2

July 4, 2025
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2025 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?