Edit

Share via


Classify and route your data using Content Understanding Studio

Content Understanding Studio enables you to create custom classification workflows that route your data to the right custom analyzer. With routing, you can input multiple different data streams into the same pipeline and ensure your data is always routed to the best analyzer.

Prerequisites

To get started, make sure you have the following resources and permissions:

  • An Azure subscription. If you don't have an Azure subscription, create a free account.
  • Once you have your Azure subscription, create a Microsoft Foundry resource in the Azure portal. Be sure to create it in a supported region.
    • This resource is listed under Foundry > Foundry in the portal.
  • Set up default model deployments for your Content Understanding resource. Setting defaults creates a connection to the Foundry models you use for Content Understanding requests. Choose one of the following methods:
    1. Go to the Content Understanding settings page
    2. Select the "+ Add resource" button in the upper left
    3. Select the Foundry resource that you want to use and click Next, then Save
      • Make sure to leave "Enable autodeployment for required models if no defaults are available." checked. This ensures your resource is fully set up with the required GPT-4.1, GPT-4.1-mini, and text-embedding-3-large models. Different prebuilt analyzers require different models.
    By taking these steps, you set up a connection between Content Understanding and Foundry models in your Foundry resource.

Sign in to Content Understanding Studio

Go to the Content Understanding Studio portal and sign in with your credentials. You might recognize the classic Azure Document Intelligence in Foundry Tools Studio experience. Content Understanding extends the same content and field extraction that you're familiar with in Document Intelligence across all modalities - document, image, video, and audio. Select the option to try out the new Content Understanding experience to get all of the multimodal capabilities of the service.

Create your custom categories

Custom categories let you route your data to a specific analyzer so you get the best output based on the type of data. In this guide, you learn how to classify invoice documents based on the client that provided them. Documents for different clients might have a different structure depending on the unique business agreement in place. This classification workflow ensures that the documents are analyzed with the correct context. To successfully route your data, you might want to create custom analyzers to route to depending on your scenario. For more information on building custom analyzers, see Create and improve your custom analyzer in Content Understanding Studio.

  1. Start with a new project: To get started with creating your custom classification workflow, select Create project on the home page.

  2. Select your project type: For this scenario, select the option to Classify and route with custom categories. To learn more about creating custom analyzers for content and field extraction, see Create and improve your custom analyzer in Content Understanding Studio.

  3. Upload your data: To get started with classifying, upload a piece of sample data.

  4. Create routing rules: Under the Routing rules tab, select Add category. Give the category a name and description, and select an analyzer to correspond to that route. For example, if you're analyzing invoices from multiple clients that each require custom schemas, you can route to the custom analyzer that was built for that specific invoice type. The tool allows you to preview the schema for each analyzer to ensure you have the right one.

    Screenshot of routes UX for classification.

  5. Test your classification workflow: When your custom routing rules are ready for testing, select Run analysis to see the output of the rules on your data. You can optionally upload additional pieces of sample data for testing to see how it performs with multiple different rules.

    Screenshot of Content Understanding Studio with the Test button highlighted.

  6. Build your classification analyzer: When you’re satisfied with the output, select the Build analyzer button at the top of the page. Give the analyzer a name and select Save.

  7. Use your classification analyzer: Now you have an analyzer endpoint that you can use in your own application via the REST API.

Next step