Foundation Model
Foundation Model refers to an AI model trained at significant scale on diverse, general-purpose data and designed as a reusable base for a wide range of applications; Foundation models are pretrained using self-supervised objectives, such as next-token prediction for language or contrastive matching for vision-language; For most product teams, foundation models eliminate the need to collect training data or run GPU clusters
Foundation Model refers to an AI model trained at significant scale on diverse, general-purpose data and designed as a reusable base for a wide range of applications. Rather than building a separate model for each task, teams fine-tune or prompt a single foundation model to handle translation, summarization, code generation, image understanding, and more.
How it works
Foundation models are pretrained using self-supervised objectives, such as next-token prediction for language or contrastive matching for vision-language. The pretraining phase requires enormous compute and data; the resulting model encodes general knowledge that downstream tasks exploit through fine-tuning, few-shot prompting, or retrieval augmentation.
Key facts
- Origin: The term was introduced in a 2021 Stanford CRFM paper highlighting emergent risks and capabilities from training at scale.
- Modalities: Foundation models now cover text, images, audio, video, and code.
- Adaptation methods: Teams adapt them via fine-tuning, LoRA, prompt engineering, or RAG rather than training from scratch.
- Providers: OpenAI, Anthropic, Google, Meta, Mistral, and Cohere are primary providers of publicly accessible foundation models.
For builders
For most product teams, foundation models eliminate the need to collect training data or run GPU clusters. The engineering work shifts to prompt design, retrieval pipelines, and evaluation rather than model training. Selecting a foundation model involves assessing context length, modality support, licensing, and hosted inference cost.
Sources
- Vaswani, A., et al. (2017). Attention Is All You Need. arXiv:1706.03762. arxiv.org
- Brown, T., et al. (2020). Language Models are Few-Shot Learners (GPT-3). arXiv:2005.14165. arxiv.org
- Bommasani, R., et al. (2021). On the Opportunities and Risks of Foundation Models. Stanford CRFM. arxiv.org
- NIST. (2023). AI Risk Management Framework (AI RMF 1.0). nist.gov
- Stanford HAI. Foundation Models research portal. hai.stanford.edu