Mesolitica builds Malaysian large language model for Gen AI assistants on AWS

December 03, 2024 | 17:31
(0) user say
Amazon Web Services (AWS) announced that Mesolitica, a Malaysian startup specialising in training large language models (LLMs), has built a Malaysian language GenAI LLM on the world’s leading cloud.

The announcement was made at AWS re:Invent 2024, which is taking place in Las Vegas from December 2-6.

Mesolitica builds Malaysian large language model for Gen AI assistants on AWS
AWS re:Invent 2024 is taking place in Las Vegas

The MaLLaM LLM can understand local nuances like slang, colloquialisms that merge different dialects, Bahasa Malayu, and 16 other regional languages for use in AI assistants across industries. Mesolitica has trained MaLLaM on 197 datasets totalling close to 200 billion tokens of publicly available Malay-specific content to provide culturally relevant AI support for applications in customer service, content generation, and data analysis in localised languages.

Using custom ML chips, including AWS Trainium, and AWS Inferentia, Mesolitica saw compute cost savings of 87 per cent while enjoying a 5.5-fold increase in throughput (transactions per second) while training MaLLaM, improving the model's responsiveness and efficiency when used for AI assistants. With AWS, Mesolitica can now deploy proofs of concept (PoCs) in as little as 24 hours, while the AWS Asia-Pacific (Malaysia) region provides a 20 per cent reduction in latency, critical for achieving human-like conversations in AI voice assistants.

"Our biggest challenge was understanding the many local languages Malaysian patients use,” said Dr. Kev Lim, CEO and founder of health-tech startup Qmed Asia. “By leaning into Mesolitica, we can now better understand local speech patterns through MaLLam. This has enhanced the accuracy of our medical note-taking solution, strengthened patient communication, and ultimately, empowered us to deliver higher quality healthcare."

With AWS, Malaysian enterprises using MaLLaM can improve operations with GenAI in regional languages to help underserved audiences like farmers in rural areas make data-driven decisions using real-time weather forecasts, soil health analysis, and crop viability assessments. The Malaysian government is also exploring the integration of MaLLaM into its operations, which aligns with the country’s broader goal of AI sovereignty and local data governance.

AI assistants built on MaLLaM can provide quick, accurate responses to citizens' inquiries in multiple languages, including dialects from different Malaysian states such as Johor, Kedah, Sarawak, Selangor, and Terengganu, to ultimately improve citizen communication and data processing capabilities across the culturally diverse country. Malaysia’s educational sector can benefit from MaLLaM through applications in language learning and research, particularly in enhancing the understanding of local languages and dialects.

“With AWS, we can deploy proofs-of-concept much faster, with the right cost-effective AI compute resources and machine learning capabilities,” said Khalil Nooh, co-founder and CEO, Mesolitica. “This allows our customers to focus only on the ongoing operational costs, rather than upfront capital expenses, for their AI experiments. This is also in line with Malaysia’s national priority to develop citizen-centric applications, making our MaLLaM GenAI assistant strategically important to the country’s digital transformation ambitions.”

Southeast Asia’s population speaks about 2,300 languages. When LLMs that are pre-trained in English and western-centric data are tasked with non-English queries, they can produce inaccuracies and misinterpretations. LLMs that are trained on culturally diverse data like MaLLaM can address this gap, boost accuracy, and better cater for the region’s diverse cultures, ways of working, and languages. As a cloud-native platform, Mesolitica needed compute-intensive resources to develop local LLMs and meet demand from private and public sector customers across the country.

Mesolitica has significantly enhanced its machine learning operations by leveraging AWS services. The company migrated its model training workloads to Amazon Elastic Cloud Compute (Amazon EC2) and deployed inference workloads using Amazon EC2 G5 instances, which provide cost-effective GPU acceleration for AI models. To enhance its infrastructure, Mesolitica implemented Amazon Elastic Kubernetes Service (Amazon EKS) to deploy and manage ML models and applications, and to orchestrate P4 Nvidia instances. Furthermore, the company utilises Amazon SageMaker, a fully managed ML service, to efficiently manage and prepare large data sets essential for training LLMs.

Mesolitica is part of the AWS Asia-Pacific and Japan (APJ) Generative AI Spotlight programme, a four-week accelerator programme which aims to support early-stage startups in the region that are developing GenAI applications. The startup is also one of two Malaysian companies to receive AWS credits from the AWS Activate Programme, a comprehensive initiative that provides access to a range of resources, including AWS credits, technical support, training, and tools tailored to help startups build, launch, and scale their applications on AWS.

Mesolitica has joined the AWS Partner Network, a global programme designed by AWS to assist businesses in leveraging AWS for growth and success.

"Malaysian startups can revolutionise industries through data-driven solutions and advanced technologies,” said Pete Murray, country manager for AWS Malaysia. “For GenAI to be relevant, it must be accessible and culturally integrated. Mesolitica is creating Malaysia's first LLM AI that’s tailored to the country's diverse population. This has the potential to support various sectors, from improving government services to financial inclusion. We're proud to host MaLLaM on the AWS Malaysia region."

Two Vietnam startups selected for AWS Generative AI Accelerator Two Vietnam startups selected for AWS Generative AI Accelerator

Amazon Web Services (AWS), a subsidiary of Amazon.com, announced on September 18 that two Vietnamese startups–AI Hay and Kompato AI (the GenAI subsidiary of Trusting Social)–have been selected for the AWS Global Generative AI Accelerator programme.

AWS announces Generative AI Partner Innovation Alliance AWS announces Generative AI Partner Innovation Alliance

Amazon Web Services (AWS), a subsidiary of Amazon.com, Inc., announced on November 4 the launch of the Generative AI Partner Innovation Alliance.

Compute for Climate Fellowship Announces Eight New Startups Selected to Build Their Proof-of-Concepts for Free on AWS Compute for Climate Fellowship Announces Eight New Startups Selected to Build Their Proof-of-Concepts for Free on AWS

The International Research Centre on Artificial Intelligence (IRCAI), an organization under the auspices of UNESCO, in collaboration with Amazon Web Services, Inc. (AWS), today announced the selection of eight groundbreaking startups as the latest recipients of the Compute for Climate Fellowship.

By Bich Thuy

What the stars mean:

★ Poor ★ ★ Promising ★★★ Good ★★★★ Very good ★★★★★ Exceptional

Latest News ⁄ Corporate ⁄ Biz Link