Vista de Lectura

Hay nuevos artículos disponibles. Pincha para refrescar la página.

Peering Into The Black Box of Large Language Models

Large Language Models (LLMs) can produce extremely human-like communication, but their inner workings are something of a mystery. Not a mystery in the sense that we don’t know how an LLM works, but a mystery in the sense that the exact process of turning a particular input into a particular output is something of a black box.

This “black box” trait is common to neural networks in general, and LLMs are very deep neural networks. It is not really possible to explain precisely why a specific input produces a particular output, and not something else.

Why? Because neural networks are neither databases, nor lookup tables. In a neural network, discrete activation of neurons cannot be meaningfully mapped to specific concepts or words. The connections are complex, numerous, and multidimensional to the point that trying to tease out their relationships in any straightforward way simply does not make sense.

Neural Networks are a Black Box

In a way, this shouldn’t be surprising. After all, the entire umbrella of “AI” is about using software to solve the sorts of problems humans are in general not good at figuring out how to write a program to solve. It’s maybe no wonder that the end product has some level of inscrutability.

This isn’t what most of us expect from software, but as humans we can relate to the black box aspect more than we might realize. Take, for example, the process of elegantly translating a phrase from one language to another.

I’d like to use as an example of this an idea from an article by Lance Fortnow in Quanta magazine about the ubiquity of computation in our world. Lance asks us to imagine a woman named Sophie who grew up speaking French and English and works as a translator. Sophie can easily take any English text and produce a sentence of equivalent meaning in French. Sophie’s brain follows some kind of process to perform this conversion, but Sophie likely doesn’t understand the entire process. She might not even think of it as a process at all. It’s something that just happens. Sophie, like most of us, is intimately familiar with black box functionality.

The difference is that while many of us (perhaps grudgingly) accept this aspect of our own existence, we are understandably dissatisfied with it as a feature of our software. New research has made progress towards changing this.

Identifying Conceptual Features in Language Models

We know perfectly well how LLMs work, but that doesn’t help us pick apart individual transactions. Opening the black box while it’s working yields only a mess of discrete neural activations that cannot be meaningfully mapped to particular concepts, words, or whatever else. Until now, that is.

A small sample of features activated when an LLM is prompted with questions such as “What is it like to be you?” and “What’s going on in your head?” (source: Extracting Interpretable Features from Claude 3 Sonnet)

Recent developments have made the black box much less opaque, thanks to tools that can map and visualize LLM internal states during computation. This creates a conceptual snapshot of what the LLM is — for lack of a better term — thinking in the process of putting together its response to a prompt.

Anthropic have recently shared details on their success in mapping the mind of their Claude 3.0 Sonnet model by finding a way to match patterns of neuron activations to concrete, human-understandable concepts called features.

A feature can be just about anything; a person, a place, an object, or more abstract things like the idea of upper case, or function calls. The existence of a feature being activated does not mean it factors directly into the output, but it does mean it played some role in the road the output took.

With a way to map groups of activations to features — a significant engineering challenge — one can meaningfully interpret the contents of the black box. It is also possible to measure a sort of relational “distance” between features, and therefore get an even better idea of what a given state of neural activation represents in conceptual terms.

Making Sense of it all

One way this can be used is to produce a heat map that highlights how heavily different features were involved in Claude’s responses. Artificially manipulating the weighting of different concepts changes Claude’s responses in predictable ways (video), demonstrating that the features are indeed reasonably accurate representations of the LLM’s internal state. More details on this process are available in the paper Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet.

Mapping the mind of a state-of-the-art LLM like Claude may be a nontrivial undertaking, but that doesn’t mean the process is entirely the domain of tech companies with loads of resources. Inspectus by [labml.ai] is a visualization tool that works similarly to provide insight into the behavior of LLMs during processing. There is a tutorial on using it with a GPT-2 model, but don’t let that turn you off. GPT-2 may be older, but it is still relevant.

Research like this offers new ways to understand (and potentially manipulate, or fine-tune) these powerful tools., making LLMs more transparent and more useful, especially in applications where lack of operational clarity is hard to accept.

PlayARTi

ARTi is a fun AI image creator that lets you choose a character, location, and activity for your image. After choosing your options using the three buttons, you can click on “Create!” to get your unique image. PlayARTi is designed to be super easy to use, almost like a point-and-click adventure game!

Source

Unicorn Platform AI

Unicorn Platform is an AI-powered website and landing page builder designed specifically for indie makers, startups, and SaaS businesses. This tool allows you to enter in an idea, and create a fully-functional and stunning website in just a few minutes. There are several templates to choose from, and the AI will take into account your […]

Source

Janitor AI

Janitor AI is an exciting new platform that offers a wide range of interactive AI chatbots. You can explore various categories of chatbots, all of which are created by other members of the community. Janitor AI has a toggle for turning on SFW and NSFW content, and also has options for showing the most popular […]

Source

Quora Launches AI Chatbot Platform Called Poe

Quora, the Q&A website, has just launched a new invite-only platform called Poe. It’s currently only available on iOS, but it’s set to be released across more platforms soon. Poe is a place where you can ask questions and have a conversation with AI chatbots. You can chat with different AI models separately within the […]

Source

Melobytes

Melobytes is a suite of AI tools for generating music and sound. There are a ton of different features which can help musicians gain inspiration, and even without prior music experience you’ll be able to generate music tracks from text or other prompts. One of the coolest tools on there is the ability to generate […]

Source

Prompt Engineering With ChatGPT and GPT-4

Category – Prompt Engineering, GPT Course Difficulty – Easy Course Length – 2 Hours 11 Minutes Price – Requires Skillshare Subscription Rating  4.5/5 View Course This course aims to teach learners how to use ChatGPT and GPT-4 effectively to go beyond average results and unlock the true potential of AI technologies. The course is […]

Source

OverflowAI

OverflowAI is a new set of AI-powered products and features being added to Stack Overflow’s public platform and Stack Overflow for Teams. The goal is to leverage AI like semantic search and natural language processing to enhance the developer experience, while still keeping the Stack Overflow community at the center. OverflowAI is expected to be […]

Source

novita.ai

novita.ai gives you access to 100+ APIs, including AI image generation & editing with 10,000+ models , and training APIs for custom models. The platform utilizes cheap pay-as-you-go pricing, freeing you from GPU maintenance hassles while building your own products. novita.ai also offers a Playground where you can run and test different image generator models […]

Source

Voicemod

Voicemod features an AI-powered voice changer with seven voice filters that use artificial intelligence to allow you to transform your voice in real-time, giving you the ability to create different personas for your online interactions. These voices include options such as a pilot, astronaut, and male and female voices, and are accessible through Voicemod’s Voicebox […]

Source

MeetGeek

MeetGeek is an AI meeting assistant that provides a smarter way to manage and analyze your meetings. It records, transcribes, summarizes, and provides insights, freeing up your calendar and allowing you to focus on meaningful conversations. With advanced features and simple usability, MeetGeek can help you uncover blind spots, capture essential information, keep your team […]

Source

Fireflies.ai

Fireflies.ai helps your team to record, transcribe, search, and analyze meetings and conversations. With AI-powered search, you can review a 1 hour meeting in 5 minutes and find key metrics. The tool also provides conversation intelligence by tracking speaker talk time and sentiment. Fireflies integrates with various apps like Slack, Notion, and Asana, and creates […]

Source

HARPA AI

HARPA AI is an AI agent in the form of a Chrome extension. With HARPA, you’re able to integrate ChatGPT to Google Search, automate websites, write text, track product prices, and much more. HARPA also offers page-aware GPT prompts for various fields such as Marketing, SEO, Copywriting, HR, and Engineering. This tool can save time […]

Source

NSFW Character AI

NSFW Character AI lets you remove restrictions and create and chat to unique NSFW characters. You can create your dream characters and watch them come to life in a metaverse of possibilities – with unfiltered NSFW AI chat. Your character can also learn while it chats to you, improving the dialogue feedback for truly personalized […]

Source

Creativz

Creativz is an AI image generator which lets you create stunning product shots anytime, anywhere. Suitable for businesses and individuals looking to professionally showcase their products. The tool currently has two main features, AI HeroShot and DreamShoot. HeroShot lets you upload an image of your product and turn it into an eye-catching hero shot using […]

Source

Mockey

Mockey is a free AI mockup generator that provides a suite of tools for AI mockup generation, AI photography, AI image generation, AI background removal, and more. It includes mockups for clothes, phone covers, posters, mugs, and over 1000 other templates. To use the tool, simply upload your designs in PNG or JPG format and […]

Source

Stable Beluga 2

Stable Beluga 2 is a new open-source LLM developed by Stability AI and is based off of the LLamA-2 model by Meta AI with 70 billion parameters. This LLM is currently leading the chart on Hugging Face’s Open LLM Leaderboard. Like most other LLMs, you’ll need an interface installed to run Stable Beluga 2 on […]

Source

LastMile AI

LastMile AI is an AI developer platform that allows engineers to rapidly prototype and productionize generative AI applications without needing to be machine learning experts. It provides a collaborative notebook-style environment to access leading generative AI models like GPT-4, GPT-3.5, Stable Diffusion and more through a single interface.

Source

Lensa

Lensa is one of the most popular AI tools for creating your own digital portrait pictures. Lensa allows you to perform facial retouching with the tap of a button, fix facial imperfections, replace or blur out backgrounds, apply unique filters and special effects and lots more. The app is available on both iOS and Android […]

Source

❌