Extract insights from visual data on Azure
(AI-3008)
This 1‑day course focuses on building intelligent applications that can see, interpret, and reason over images and documents using different multimodal models and agent-based tools. Learners explore how visual and document inputs can be combined with language models to enable structured extraction, analysis, and decision-making workflows. The course emphasizes practical patterns for extracting information, orchestrating tools, and grounding model responses in visual data.
Audience Profile
This course is designed for developers, AI engineers, and technical professionals who want to build applications that work with images and documents using multimodal, agent-driven approaches. It’s best suited for learners with basic programming experience and a general understanding of cloud or AI concepts.
Prerequisites
Before starting this learning path, you should already have:
- Familiarity with Azure and Microsoft Foundry.
- Programming experience.
Course Syllabus - Extract insights from visual data on Azure
Develop a vision-enabled generative AI application
A picture says a thousand words, and multimodal generative AI models can interpret images to respond to visual prompts. Learn how to build vision-enabled chat apps.
- Introduction
- Use a vision-capable model in the Microsoft Foundry portal
- Develop a vision-based chat app
- Exercise - Develop a vision-enabled chat app
- Module assessment
- Summary
Generate images with AI
In Microsoft Foundry, you can use image generation models to create original images based on natural language prompts.
- Introduction
- What are image-generation models?
- Explore image-generation models in Microsoft Foundry portal
- Create a client application that uses an image generation model
- Exercise - Generate images with AI
- Module assessment
- Summary
Generate videos with Microsoft Foundry
Learn how to generate videos from text prompts with Sora 2 in Microsoft Foundry.
- Introduction
- Deploy a video generating model
- Generate video from a prompt
- Generate video in Python
- Exercise - Generate video with Sora 2 in Microsoft Foundry
- Module assessment
- Summary
Analyze images with Content Understanding
Learn how to analyze images with Azure Content Understanding.
- Introduction
- What is Content Understanding?
- Analyze images with Content Understanding
- Exercise - Analyze images with Content Understanding
- Module assessment
- Summary
Create a multimodal analysis solution with Azure Content Understanding
Use Azure Content Understanding for multimodal content analysis and information extraction.
- Introduction
- What is Azure Content Understanding?
- Create a Content Understanding analyzer
- Use the Content Understanding API
- Exercise - Extract information from multimodal content
- Module assessment
- Summary
Create an Azure Content Understanding client application
Use the Azure Content Understanding API for multimodal content analysis and information extraction.
- Introduction
- Prepare to use the AI Content Understanding API
- Create a Content Understanding analyzer
- Analyze content
- Exercise - Develop a Content Understanding client application
- Module assessment
- Summary
Extract data with Azure Document Intelligence
Azure Document Intelligence uses OCR and deep learning models to extract text, key-value pairs, tables, and structured data from forms and documents. Learn how to use prebuilt and custom models to automate document processing.
- Introduction
- What is Azure Document Intelligence?
- Use the Document Intelligence Studio
- Use prebuilt models
- Train and use custom models
- Exercise - Analyze documents with Document Intelligence
- Module assessment
- Summary
Create a knowledge mining solution with Azure AI Search
Unlock the hidden insights in your data with Azure AI Search. In this module, you'll learn how to implement a knowledge mining solution that extracts and enriches data, making it searchable and ready for deeper analysis.
- Introduction
- What is Azure AI Search?
- Extract data with an indexer
- Enrich extracted data with AI skills
- Search an index
- Persist extracted information in a knowledge store
- Exercise - Create a knowledge mining solution
- Module assessment
- Summary

