The implementation domain of AI-901. Weighted at 55–60%, it covers building generative AI apps and agents, text and speech solutions, computer vision applications, and information extraction — all using Microsoft AI Foundry and Foundry Tools.
Implement AI Solutions by Using Microsoft Foundry domain is organised into four skill areas, each covering a distinct implementation context within Microsoft AI Foundry: generative AI apps and agents; text and speech solutions; computer vision and image generation; and information extraction using Azure Content Understanding in Foundry Tools. Questions mix scenario-based decisions, True/False statements about specific behaviours, sentence-completion questions, and Python code-completion questions, where candidates fill in a single value in a short code snippet. The table below maps every official sub-topic to the type of question you will face on it in the exam.
| Sub-topic / Area | What kind of question to expect |
|---|---|
| Create effective system and user prompts for generative AI models |
|
| Deploy a model and interact with it in the Foundry portal |
|
| Lightweight chat client application using the Foundry SDK |
|
| Create and test a single-agent solution in the Foundry portal |
|
| Lightweight client application for an agent |
|
| Text analysis application |
|
| Respond to spoken prompts by using a deployed multimodal model (Voice Live) |
|
| Build a lightweight application by using Azure Speech in Foundry Tools |
|
| Interpret visual input in prompts by using a deployed multimodal model |
|
| Create new visual outputs by using generative models |
|
| Build a lightweight application that includes vision capabilities |
|
| Extract from documents and forms by using Azure Content Understanding in Foundry Tools |
|
| Extract from images by using Content Understanding |
|
| Extract from audio and video by using Content Understanding |
|
| Build a lightweight information extraction application using Content Understanding |
|
Topic-specific questions about what AI-901 tests in Domain 02 and how to approach the key implementation areas.
Domain 02 uses four question formats: scenario-based multiple choice (a situation is described and candidates choose the correct service, tool, or configuration); True/False (a statement about a specific behaviour or feature is presented and candidates confirm whether it is correct); sentence-completion (a sentence with a blank is provided and candidates choose the word or phrase that correctly fills it); and Python code-completion (a short code snippet is shown with one or two placeholder values that candidates must identify from a list of options). Code-completion questions are more common in Domain 02 than in Domain 01, so candidates should be comfortable reading short Python snippets.
System prompts are tested through True/False questions about what they do and do not control, sentence-completion questions about what must be configured to define an agent's behaviour, and multi-select questions about the purposes of instructions when prompting a generative AI model. Candidates need a precise understanding of what system prompts govern versus what is controlled by other configuration mechanisms in Foundry.
Code-completion questions present short Python snippets that connect an application to a deployed model and ask candidates to fill in placeholder values. Candidates need to understand the relationship between the different resources involved in a Foundry deployment and know which identifier is used at each point in the connection code. Scenario questions also ask what information a client application must include when calling a deployed model.
The tool_choice parameter is tested through True/False questions about the behaviour each value produces, and through scenario questions where candidates must select the correct value to enforce a described tool-calling requirement. Candidates need to know what each possible value of this parameter causes the agent to do and be able to distinguish clearly between the options.
Azure Content Understanding is tested across multiple question formats. True/False questions test the capabilities and limitations of the service. Sentence-completion questions test processing behaviour and what defines the extraction output. Scenario questions ask candidates to select the correct analyzer type for a given content format, choose between Content Understanding and other Azure services, and identify which Azure resource must be provisioned before the service can be used. Code-completion questions test the SDK pattern for submitting content and retrieving results.
Image input is tested through sentence-completion and multi-select questions about the formats in which images can be provided via the OpenAI Responses API, True/False questions about constraints and capabilities in the Foundry playground, questions about the correct content item type to include in a request, and scenario questions about message structure when combining text and image content. Code-completion questions test the correct field values to use when sending image data from an application.
Azure Speech is tested through scenario questions that ask candidates to select the correct capability for a described requirement, code-completion questions involving the Azure Speech SDK where candidates fill in a placeholder class or method name, and questions that ask candidates to identify which SDK class is responsible for a specific part of the speech workflow. Candidates need to know the distinct capabilities within Azure Speech and understand how the SDK is structured across both synthesis and recognition scenarios.
Share how you reason through topics on this page. You can also share your feedback on this guide.