Description of LLMs available

Comparing models in different use cases

There is a long list of models available in Stack AI, but before getting too overwhelmed with the description of each model, let's discuss some of the key use cases and which models are preferred.

List of all available LLMs

GPT 4

GPT 4 is a large multimodal model (accepting text inputs and emitting text outputs today, with image inputs coming in the future) that can solve difficult problems with greater accuracy than any of our previous models, thanks to its broader general knowledge and advanced reasoning capabilities.

GPT-3.5

GPT-3.5 is a mid-generation upgrade of GPT-3 with fewer parameters. It includes a fine-tuning process that involves reinforcement learning with human feedback, which helps to improve the accuracy of the responses.

GPT-3

These models can understand and generate natural language, and were the original ones of ChatGPT. These models were superseded by the more powerful GPT-3.5 generation models, but can be still used for simpler tasks and to reduce cost and increase speed.

Anthropic

Anthropic's model called Claude is a is a transformer-based LLM, much like GPT-3, that leverages large-scale machine learning techniques. The model is trained on a diverse range of internet text, giving it the ability to generate text that is coherent, contextually relevant, and remarkably human-like.

Below the comparison between the different versions of Claude.

Google

Stack AI has early access to Google's PaLM 2 model, the Large Language Model (LLM) released by Google. It is highly capable in advanced reasoning, coding, and mathematics. It's also multilingual and supports more than 100 languages. PaLM 2 is a successor to the earlier Pathways Language Model (PaLM) launched in 2022.

The two models available are the following.

Last updated