Edit

Share via


Select Azure PaaS solutions for AI

This article explains how to select resources for Azure AI platform as a service (PaaS) solutions. The following table summarizes the main Azure AI PaaS solutions and key decision criteria.

AI services AI type Description Skills required
Microsoft Foundry Generative AI and nongenerative AI A platform for building and deploying generative and nongenerative AI applications Developer and data science skills
Azure AI services Generative AI and nongenerative AI Various services that provide prebuilt generative and nongenerative AI models Developer skills
Azure OpenAI Generative AI A service for accessing OpenAI models Developer and data science skills
Azure Machine Learning Machine learning A service for training and deploying machine learning models Developer skills and advanced data science skills

Select resources for generative AI workloads

Generative AI uses multiple resources to process input data and produce meaningful outputs. To build effective applications, such as those that use retrieval-augmented generation (RAG), select resources that ground AI models and deliver accurate results.

Diagram that shows the basic components of a generative AI workload.

Generative AI workflow

The following workflow matches the diagram above:

  1. The AI app receives a user query.
  2. An orchestrator, such as Foundry Agent Service or the Microsoft Agent Framework, manages data flow.
  3. A search and retrieval mechanism finds the grounding data.
  4. The mechanism sends the grounding data to a generative AI platform.
  5. The generative AI platform creates a response using the user query and grounding data.

Generative AI resource selection

Follow these steps to build generative RAG workloads:

  1. Select a generative AI platform. Use Foundry or Azure OpenAI to deploy and manage generative AI models. Foundry offers a code-first platform with built-in tools for development, deployment, and orchestration. Choose Azure OpenAI if you only need access to OpenAI models.

  2. Select an AI compute type. Foundry requires compute instances for specific features. Pick a compute type that fits your performance and budget needs.

  3. Select an orchestrator. Use orchestrators like Foundry Agent Service or Microsoft Agent Framework to manage data flow and interactions. If your workload uses multiple agents, make sure your orchestrator supports the AI agent orchestration patterns you need.

  4. Select a search and knowledge retrieval mechanism. To ground generative AI models, create an index or vector database for relevant data. Use Azure AI Search to build traditional and vector indexes from various data sources, apply data chunking, and use multiple query types. For structured databases, consider Azure Cosmos DB, Azure Database for PostgreSQL, or Azure Managed Redis.

  5. Select a data source for grounding data. Store grounding data in Azure Blob Storage for images, audio, video, or large datasets. You can also use databases supported by AI Search or vector databases.

  6. Select a compute platform. Use the Azure compute decision tree to choose the right platform for your workload.

Select resources for nongenerative AI workloads

Nongenerative AI workloads use platforms, compute resources, data sources, and data processing tools to support machine learning tasks. Select resources that help you build AI workloads with prebuilt or custom solutions.

Diagram that shows the basic components of a nongenerative AI workload.

Nongenerative AI workflow

The following workflow matches the diagram above:

  1. The AI app ingests incoming data.
  2. An optional data processing mechanism extracts or transforms the data.
  3. An AI model endpoint analyzes the data.
  4. You can use the data for training or fine-tuning AI models.

Nongenerative AI resource selection

Follow these steps to build nongenerative AI workloads:

  1. Select a nongenerative AI platform. Use AI services or Machine Learning based on your needs. AI services offer prebuilt models that simplify deployment and reduce the need for advanced data science skills. Machine Learning lets you develop custom models with your data and integrate them into your workloads.

  2. Select an AI compute type. Machine Learning requires compute resources to run jobs or host endpoints. Pick a compute type that fits your performance and budget needs. AI services do not require compute resources.

  3. Select a data source. Use supported data sources to host training data for Machine Learning. Many AI services do not require fine-tuning data. Some AI services, such as Azure AI Custom Vision, let you upload local files to managed storage.

  4. Select a compute platform. Use the Azure compute decision tree to choose the right platform for your workload.

  5. Select a data processing service (optional). Use Azure Functions to process serverless data. Use Azure Event Grid to trigger data processing pipelines.

Next steps