Close Menu
Versa AI hub
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

What's Hot

Business owners are seeking approval for a new hookah lounge and beer service in Massachusetts

July 15, 2025

Military AI contract awarded to humanity, Openai, Google and Xai

July 15, 2025

CHATGPT: Top AI Productivity Tools for Streamlined Creator Workflows

July 15, 2025
Facebook X (Twitter) Instagram
Versa AI hubVersa AI hub
Tuesday, July 15
Facebook X (Twitter) Instagram
Login
  • AI Ethics
  • AI Legislation
  • Business
  • Cybersecurity
  • Media and Entertainment
  • Content Creation
  • Art Generation
  • Research
  • Tools
Versa AI hub
Home»Research»Sony researchers propose Talkhier: a new AI framework for LLM-MA systems that address key challenges in communication and improvement
Research

Sony researchers propose Talkhier: a new AI framework for LLM-MA systems that address key challenges in communication and improvement

By February 23, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

LLM-based multi-agent (LLM-MA) systems allow multiple language model agents to collaborate on complex tasks by splitting their responsibilities. These systems are used in robotics, finance and coding, but face the challenges of communication and refinement. Text-based communication leads to long, unstructured exchanges, making it difficult to track tasks, maintain structure and remember past interactions. Improvements such as discussion and feedback-based improvements fight because the processing order can cause important input to be ignored or biased. These issues limit the efficiency of LLM-MA systems in handling multi-step problems.

Currently, LLM-based multi-agent systems use discussion, self-healing, and multi-agent feedback to handle complex tasks. These techniques are not structured based on text-based interactions and become difficult to control. Agents struggle to follow subtasks, recall previous interactions, and provide consistent responses. Various communication structures, including chain and tree-based models, attempt to improve efficiency, but there is no explicit protocol for structuring the information. Feedback – Repair techniques try to improve accuracy, but biased and overlapping inputs create challenges and unreliable evaluations. Without systematic communication and large-scale feedback, such systems are still inefficient and error prone.

To mitigate these issues, researchers at Sony Group Corporation in Japan have introduced Talkhier, a framework that uses structured protocols and hierarchical refinement to improve communication and task coordination in multi-agent systems. I proposed it. Unlike standard approaches, Talkhier increasingly nuances explain the interaction of agent-task formulations, reducing errors and efficiency. Agents perform formalized roles, and scaling is automatically adapted to different problems across systems, resulting in improved decision-making and coordination.

This framework configures the agents in the graph such that each node is an agent and the edge represents the communication path. The agent owns independent memory. This allows you to retain relevant information and make decisions based on the informational input without using shared memory. Communication follows a formal process. The message includes content, background information, and intermediate output. Agents are teamed up with supervisors who are monitoring the process, with a subset of agents acting as members and supervisors, resulting in a nested hierarchy. The work is assigned, evaluated and improved in a series of iterations until passing a quality threshold, with the goal of minimizing accuracy and errors.

Once assessed, researchers evaluated Talkhier across multiple benchmarks to analyze its effectiveness. In the MMLU dataset covering moral scenarios, university physics, machine learning, formal logic, and US foreign policy, Talkhier, built on the GPT-4o, achieves the highest accuracy of 88.38%. The Agentverse (83.66%) and the single symbolic baseline has surpassed React-7@ (67.19%) and GPT-4O-7@ (71.15%) show the advantages of hierarchical refinement. In the wikiqa dataset, it is superior to the baseline for open domain question answers with a Rouge-1 score of 0.3461 (+5.32%) and 0.6079 (+3.30%). Ablation studies showed that removing evaluation supervisors or structured communications significantly reduced accuracy and confirmed its importance. Talkhier outperformed by 17.63% in character count violations against camera datasets of fidelity, flow ency, charm, and ad text-generated, human ratings validated multi-agent ratings. While the internal architecture of Openai-O1 has not been revealed, Talkhier has posted competitive MMLU scores, crucially beaten on Wikiqa, giving it more flexibility than a majority vote and open source multi-agent system It has been shown.

Ultimately, the proposed framework improves communication, inference, and coordination of LLM multi-agent systems by combining structured protocols and hierarchical refinement, resulting in better performance on several benchmarks. Ta. Structured interactions were guaranteed without sacrificing heterogeneous agent feedback, including messages, interim results, and contextual information. Even with the increased API costs, Talkhier has set up a new benchmark for scalable and objective multi-agent cooperation. This methodology serves as a baseline for subsequent research, directing improved effective communication mechanisms and low-cost multi-agent interactions, and ultimately towards advances in LLM-based cooperative systems. .

Please see the paper and the github page. All credits for this study will be directed to researchers in this project. Also, feel free to follow us on Twitter. Don’t forget to join 75K+ ML SubredDit.

Committed read-lg lg ai Research releases Nexus: an advanced system that integrates agent AI systems and data compliance standards to address legal concerns in AI datasets

Divyesh is a consulting intern at MarkTechPost. He pursues Btech in agriculture and food engineering at Indian Institute of Technology, Haragpur. He is a data science and machine learning enthusiast who wants to integrate these key technologies into the agriculture domain and solve challenges.

Commended open source AI platform recommended: “Intelagent is an open source multiagent framework for evaluating complex conversational AI systems” (promotion)

author avatar
See Full Bio
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSaudi Arabia’s Media Forum opens with a focus on AI
Next Article Researchers at Moonshot AI and UCLA will release a 3B/16B parameter mixture of exper (MOE) model trained with 5.7T tokens using Muon Optimizer.

Related Posts

Research

People are beginning to sound like AI, research shows

July 13, 2025
Research

IIT has launched an MRI research facility to promote innovation and AI integration

July 13, 2025
Research

Research shows that artificial intelligence (AI) coding aids do not increase productivity.

July 12, 2025
Add A Comment
Leave A Reply Cancel Reply

Top Posts

Data and AI Status: Security and Privacy

July 12, 20251 Views

Will AI apps help carry the mental load of moms?

May 8, 20251 Views

The UAE announces bold AI-led plans to revolutionize the law

April 22, 20251 Views
Stay In Touch
  • YouTube
  • TikTok
  • Twitter
  • Instagram
  • Threads
Latest Reviews

Subscribe to Updates

Subscribe to our newsletter and stay updated with the latest news and exclusive offers.

Most Popular

Data and AI Status: Security and Privacy

July 12, 20251 Views

Will AI apps help carry the mental load of moms?

May 8, 20251 Views

The UAE announces bold AI-led plans to revolutionize the law

April 22, 20251 Views
Don't Miss

Business owners are seeking approval for a new hookah lounge and beer service in Massachusetts

July 15, 2025

Military AI contract awarded to humanity, Openai, Google and Xai

July 15, 2025

CHATGPT: Top AI Productivity Tools for Streamlined Creator Workflows

July 15, 2025
Service Area
X (Twitter) Instagram YouTube TikTok Threads RSS
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
© 2025 Versa AI Hub. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.

Sign In or Register

Welcome Back!

Login to your account below.

Lost password?