Revolutionizing AI with Self-Critiquing Frameworks
Recent advancements in artificial intelligence (AI) have sparked discussions about innovative approaches to improve user interaction. One notable development is the Critic-RM, a self-critiquing AI framework designed to enhance reward modeling and human preference alignment within large language models (LLMs).

Critic-RM aims to overcome the limitations of traditional reward models by generating critiques alongside scalar scores, providing richer feedback signals for the optimization process of LLM outputs. Challenges remain, particularly in integrating critiques into reward models due to differing objectives and resource demands.
Advancements in AI Applications
Developed by researchers from GenAI, Meta, and Georgia Institute of Technology, Critic-RM enhances reward model training through a two-stage process that generates critiques with discrete scores, ensuring quality aligned with human preferences. This framework has shown a significant improvement in reward model accuracy—between 3.7% and 7.3%—on various benchmarks.
Cultural Relevance in AI: Malaysia’s MaLLaM
Meanwhile, in Malaysia, Mesolitica has unveiled the MaLLaM generative AI model, tailored to understand local languages and dialects. This initiative is part of the country’s strategy to create citizen-centric applications that address the needs of diverse communities.
MaLLaM is trained on a massive dataset, incorporating 197 datasets totaling nearly 200 billion tokens of Malay-specific content, allowing it to provide accurate support in customer service, data analysis, and more.
Dr. Kev Lim, CEO of healthtech start-up Qmed Asia, shared that leveraging MaLLaM’s capabilities has significantly enhanced the accuracy of their medical note-taking solutions, ultimately improving communication with patients.
AI’s Future and Societal Impact
Both Critic-RM and MaLLaM exemplify the future of AI technologies: personalized user experiences and contextual understanding. These innovations not only streamline operations but also empower underserved communities to access AI-driven solutions in their preferred languages.
The collaboration between technology and local languages signifies the ongoing evolution of AI, focusing on inclusivity, efficiency, and elevated engagement.
Conclusion
As AI continues to integrate more deeply into various sectors, frameworks like Critic-RM and models like MaLLaM demonstrate the potential of AI to improve functionality and user interaction, paving the way for smarter and more community-oriented solutions.
- 0 Comments
- Critic-RM
- Language Understanding
- Localisation
- MaLLaM