Skip to content

Commit 50db6c8

Browse files
authored
Merge pull request #14699 from v-stsavell/14692
Pipeline: Update publication date for AI search skillsets article
2 parents 9f0ba91 + 82701d7 commit 50db6c8

File tree

2 files changed

+17
-15
lines changed

2 files changed

+17
-15
lines changed

docs/solution-ideas/articles/ai-search-skillsets-content.md

Lines changed: 13 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -4,13 +4,15 @@ This article describes how to use image processing, natural language processing,
44

55
## Architecture
66

7-
:::image type="content" alt-text="Diagram that shows the AI Search architecture to convert unstructured data into structured data." source="../media/ai-search-skillsets.svg" lightbox="../media/ai-search-skillsets.svg" border="false":::
7+
:::image type="complex" border="false" source="../media/ai-search-skillsets.svg" alt-text="Diagram that shows the AI Search architecture to convert unstructured data into structured data." lightbox="../media/ai-search-skillsets.svg":::
8+
The image has three key sections: unstructured data, AI enrichment, and knowledge store. The unstructured data section includes Blob Storage, documents, and images. The AI enrichment section includes both built-in skills and custom skills. The knowledge store section includes Blob Storage and Table Storage. Numbered steps show the flow of data. In the steps, unstructured data is ingested, enriched, indexed, projected, and queried.
9+
:::image-end:::
810

911
*Download a [Visio file](https://arch-center.azureedge.net/ai-search-skillsets.vsdx) of this architecture.*
1012

1113
### Dataflow
1214

13-
The following dataflow corresponds to the preceding diagram. The dataflow describes how the unstructured JFK Files dataset passes through the AI Search skills pipeline to produce structured and indexable data.
15+
The following dataflow corresponds to the previous diagram. The dataflow describes how the unstructured JFK Files dataset passes through the AI Search skills pipeline to produce structured and indexable data.
1416

1517
1. Unstructured data in Azure Blob Storage, such as documents and images, is ingested into AI Search.
1618

@@ -34,7 +36,7 @@ The following dataflow corresponds to the preceding diagram. The dataflow descri
3436

3537
- [Azure AI Vision](/azure/ai-services/computer-vision/overview) is a service that extracts text and visual information from images. In this architecture, it uses [text recognition](/azure/ai-services/computer-vision/overview-ocr) to extract and recognize text information from images. The [Read API](/azure/ai-services/computer-vision/overview-ocr#ocr-read-editions) uses OCR recognition models and is optimized for large, text-heavy documents and noisy images.
3638

37-
- [Azure AI Language](/azure/ai-services/language-service/overview) is a text analytics service that extracts structured information from unstructured text by using capabilities like [named entity recognition](/azure/synapse-analytics/machine-learning/overview-cognitive-services) and [key phrase extraction](/azure/search/cognitive-search-skill-keyphrases). In this architecture, Language enriches the JFK Files by identifying named entities and key phrases to support semantic search and filtering.
39+
- [Azure AI Language](/azure/ai-services/language-service/overview) is a text analytics service that extracts structured information from unstructured text by using capabilities like [named entity recognition](/azure/ai-services/language-service/named-entity-recognition/overview) and [key phrase extraction](/azure/search/cognitive-search-skill-keyphrases). In this architecture, Language enriches the JFK Files by identifying named entities and key phrases to support semantic search and filtering.
3840

3941
- [Blob Storage](/azure/well-architected/service-guides/azure-blob-storage) is a REST-based object storage solution optimized for large volumes of unstructured data. You can use Blob Storage to expose data publicly or to store application data privately. In this architecture, Blob Storage stores the original JFK Files dataset, including scanned documents and images, which are ingested into the AI enrichment pipeline.
4042

@@ -52,17 +54,17 @@ You can use AI enrichment in AI Search to extract and enhance searchable, indexa
5254

5355
The AI Search skills in this solution can be categorized into the following groups:
5456

55-
- **Image processing**: This solution uses built-in [text extraction](/azure/search/cognitive-search-concept-image-scenarios) and [image analysis](/azure/ai-services/computer-vision/overview-image-analysis?tabs=4-0) skills, including object and face detection, tag and caption generation, and celebrity and landmark identification. These skills create text representations of image content, which you can search by using the query capabilities of AI Search. *Document cracking* is the process of extracting or creating text content from nontext sources.
57+
- **Image processing:** This solution uses built-in [text extraction](/azure/search/cognitive-search-concept-image-scenarios) and [image analysis](/azure/ai-services/computer-vision/overview-image-analysis?tabs=4-0) skills, including object and face detection, tag and caption generation, and celebrity and landmark identification. These skills create text representations of image content, which you can search by using the query capabilities of AI Search. *Document cracking* is the process of extracting or creating text content from nontext sources.
5658

57-
- **Natural language processing**: This solution uses built-in skills like [entity recognition](/azure/search/cognitive-search-skill-entity-recognition), [language detection](/azure/search/cognitive-search-skill-language-detection), and [key phrase extraction](/azure/search/cognitive-search-skill-keyphrases) that map unstructured text to searchable and filterable fields in an index.
59+
- **Natural language processing:** This solution uses built-in skills like [entity recognition](/azure/search/cognitive-search-skill-entity-recognition), [language detection](/azure/search/cognitive-search-skill-language-detection), and [key phrase extraction](/azure/search/cognitive-search-skill-keyphrases) that map unstructured text to searchable and filterable fields in an index.
5860

59-
- **Custom skills**: This solution uses custom skills that extend AI Search to apply specific enrichment transformations to content. You can [specify the interface for a custom skill](/azure/search/cognitive-search-custom-skill-interface) through the [custom web API skill](/azure/search/cognitive-search-custom-skill-web-api).
61+
- **Custom skills:** This solution uses custom skills that extend AI Search to apply specific enrichment transformations to content. You can [specify the interface for a custom skill](/azure/search/cognitive-search-custom-skill-interface) through the [custom web API skill](/azure/search/cognitive-search-custom-skill-web-api).
6062

6163
### Potential use cases
6264

63-
The JFK Files [sample project](https://github.com/microsoft/AzureSearch_JFK_Files) and [online demo](https://jfk-demo-2019.azurewebsites.net/#/) presents a particular AI Search use case. This solution idea isn't intended to be a framework or scalable architecture for all scenarios. Instead, this solution idea provides a general guideline and example. The code project and demo create a public website and publicly readable storage container for extracted images, so you shouldn't use this solution with nonpublic data.
65+
The JFK Files [sample project](https://github.com/microsoft/AzureSearch_JFK_Files) and [online demo](https://jfk-demo-2019.azurewebsites.net/#/) presents a specific AI Search use case. This solution idea isn't intended to be a framework or scalable architecture for all scenarios. Instead, this solution idea provides a general guideline and example. The code project and demo create a public website and publicly readable storage container for extracted images, so you shouldn't use this solution with nonpublic data.
6466

65-
You can also use this architecture to:
67+
You can also use this architecture to perform the following actions:
6668

6769
- Increase the value and utility of unstructured text and image content in search apps and data science apps.
6870

@@ -76,13 +78,13 @@ You can also use this architecture to:
7678

7779
## Contributors
7880

79-
*This article is maintained by Microsoft. It was originally written by the following contributor.*
81+
*Microsoft maintains this article. The following contributors wrote this article.*
8082

8183
Principal author:
8284

83-
* [Carlos Alexandre Santos](https://www.linkedin.com/in/carlosafsantos) | Senior Specialized AI Cloud Solution Architect
85+
- [Carlos Alexandre Santos](https://www.linkedin.com/in/carlosafsantos) | Senior Specialized AI Cloud Solution Architect
8486

85-
*To see non-public LinkedIn profiles, sign in to LinkedIn.*
87+
*To see nonpublic LinkedIn profiles, sign in to LinkedIn.*
8688

8789
## Next steps
8890

docs/solution-ideas/articles/ai-search-skillsets.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
### YamlMime:Architecture
22
metadata:
3-
title: Use AI enrichment with image and text processing
3+
title: Use AI Enrichment With Image and Text Processing
44
description: Learn how to transform unstructured image and text data into full-text searchable content with Azure AI Search prebuilt skills and custom skills.
55
author: gmndrg
66
ms.author: gimondra
7-
ms.date: 06/24/2024
8-
ms.topic: conceptual
7+
ms.date: 09/30/2025
8+
ms.topic: solution-idea
99
ms.subservice: solution-idea
1010
ms.category:
1111
- ai-machine-learning
@@ -26,4 +26,4 @@ products:
2626
- azure-functions
2727
thumbnailUrl: /azure/architecture/browse/thumbs/ai-search-skillsets.png
2828
content: |
29-
[!include[](ai-search-skillsets-content.md)]
29+
[!INCLUDE[](ai-search-skillsets-content.md)]

0 commit comments

Comments
 (0)