LLM tagged posts

Adobe announces development of SLM that can Run Locally on a Phone with No Cloud Connection

app
Credit: Pixabay/CC0 Public Domain

A small team of AI researchers at Adobe Inc., working with a colleague from Auburn University and another from Georgia Tech, has developed a small language model (SLM) that they claim can be run locally on a smart phone with no access to the cloud. The group has written a paper describing their new app, which they call SlimLM, and have posted it to the arXiv preprint server.

As LLM technology continues to mature, researchers across the globe continue to find new ways to improve it. In this new effort, the research team has found a way to cut the cord for a specific type of AI application—processing documents locally.

As LLMs such as ChatGPT become more popular, users have become more worried about privacy...

Read More

DeepMind Researchers find LLMs can Serve as Effective Mediators

DeepMind researchers find LLMs can serve as effective mediators
The Habermas Machine generates high-quality group opinion statements that are preferred to human-written group statements, and critiquing provides further improvements. Credit: Science (2024). DOI: 10.1126/science.adq2852

A team of AI researchers with Google’s DeepMind London group has found that certain large language models (LLMs) can serve as effective mediators between groups of people with differing viewpoints regarding a given topic. The work is published in the journal Science.

Over the past several decades, political divides have become common in many countries—most have been labeled as either liberal or conservative...

Read More

As LLMs Grow Bigger, they’re more likely to give Wrong Answers than Admit Ignorance

As LLMs grow bigger, they're more likely to give wrong answers than admit ignorance
Performance of a selection of GPT and LLaMA models with increasing difficulty. Credit: Nature (2024). DOI: 10.1038/s41586-024-07930-y

A team of AI researchers at Universitat Politècnica de València, in Spain, has found that as popular LLMs (Large Language Models) grow larger and more sophisticated, they become less likely to admit to a user that they do not know an answer.

In their study published in the journal Nature, the group tested the latest version of three of the most popular AI chatbots regarding their responses, accuracy, and how good users are at spotting wrong answers.

As LLMs have become mainstream, users have become accustomed to using them for writing papers, poems or songs and solving math problems and other tasks, and the issue of accuracy has become a bigger...

Read More

Language Agents Help Large Language Models ‘Think’ Better and Cheaper

Language agents help large language models 'think' better and cheaper
An example of the agent producing task-specific instructions (highlighted) for a classification dataset IMDB. The agent only runs once to produce the instructions. Then, the instructions are used for all our models during reasoning. Credit: arXiv (2023). DOI: 10.48550/arxiv.2310.03710

The LLMs that have increasingly taken over the tech world are not “cheap” in many ways. The most prominent LLMs, such as GPT-4, took some $100 million to build in the form of legal costs of accessing training data, computational power costs for what could be billions or trillions of parameters, the energy and water needed to fuel computation, and the many coders developing the training algorithms that must run cycle after cycle so the machine will “learn.”

But, if a researcher needs to do a specializ...

Read More