Saying a renaissance in laptop imaginative and prescient AI with Microsoft’s Florence basis mannequin | Azure Weblog and Updates


Extract strong insights from picture and video content material with Azure Cognitive Service for Imaginative and prescient

We’re happy to announce the general public preview of Microsoft’s Florence basis mannequin, skilled with billions of text-image pairs and built-in as cost-effective, production-ready laptop imaginative and prescient providers in Azure Cognitive Service for Imaginative and prescient. The improved Imaginative and prescient Companies permits builders to create cutting-edge, market-ready, accountable laptop imaginative and prescient purposes throughout varied industries. Clients can now seamlessly digitize, analyze, and join their information to pure language interactions, unlocking highly effective insights from their picture and video content material to help accessibility, drive acquisition by web optimization, defend customers from dangerous content material, improve safety, and enhance incident response instances.

Microsoft was just lately named a Chief within the IDC MarketScape: Worldwide Basic-Objective Laptop Imaginative and prescient AI Software program Platforms 2022 Vendor Evaluation (doc #US49776422, November 2022). The brand new Imaginative and prescient Companies improves content material discoverability with computerized captioning, sensible cropping, classifying, background elimination, and trying to find pictures. Moreover, customers can observe actions, analyze environments, and obtain real-time alerts with accountable AI controls. 

Reddit might be utilizing Imaginative and prescient Companies to generate captions for a whole lot of tens of millions of pictures on its platform. Tiffany Ong, Reddit Product Supervisor of Client Product has stated,

“With Microsoft’s Imaginative and prescient know-how, we’re making it simpler for customers to find and perceive our content material. The newly created picture captions make Reddit extra accessible for everybody and provides redditors extra alternatives to discover our pictures, have interaction in conversations, and in the end construct connections and a way of neighborhood.”

Microsoft is harnessing the ability of the brand new Imaginative and prescient Companies in Microsoft 365 apps like Groups, PowerPoint, Outlook, Phrase, Designer, OneDrive, along with the Microsoft Datacenter. Microsoft Groups is driving innovation within the digital house with the assistance of segmentation capabilities, taking digital conferences to the following stage. PowerPoint, Outlook, and Phrase leverage picture captioning for computerized alt-text to enhance accessibility. Microsoft Designer and OneDrive are utilizing improved picture tagging, picture search, and background era to simplify picture discoverability and modifying. Microsoft Datacenters are leveraging Imaginative and prescient Companies to boost safety and infrastructure reliability.

At this week’s Microsoft Skill Summit, corporations will learn the way they’ll enhance the accessibility of their visible content material. We’ll share the way forward for our Seeing AI app and LinkedIn will share the advantages of using Imaginative and prescient Companies to ship computerized alt-text descriptions for picture evaluation. As a preview, Jennison Asuncion, LinkedIn’s Head of Accessibility Engineering Evangelism has stated,

“Greater than 40 p.c of LinkedIn’s feed posts embrace not less than one picture. We wish each member to have equal entry to alternative and are dedicated to making sure that we make pictures accessible to our members who’re blind or who’ve low imaginative and prescient so they could be a a part of the net dialog. With Azure Cognitive Service for Imaginative and prescient, we will present auto-captioning to edit and help alt. textual content descriptions. I am enthusiastic about this new expertise as a result of now, not solely will I do know my colleague shared an image from an occasion they attended, however that my CEO Ryan Roslansky can be within the image.”

Check out the brand new out-of-the-box options our prospects are utilizing in Imaginative and prescient Studio:

  • Dense captions: Routinely ship wealthy captions, design ideas, accessible alt-text, web optimization optimization, and clever picture curation to help digital content material.
  • Picture retrieval: Enhance search suggestions and commercials with pure language queries that seamlessly measure the similarity between pictures and textual content.

  • Background elimination: Rework the feel and appear of pictures by simply segmenting folks and objects from their unique background, changing them with a most popular background scene.
  • Mannequin customization: Decrease prices and time to ship customized fashions that match distinctive enterprise calls for at excessive precision, and with only a handful of pictures.
  • Video summarization (Video TL;DR): Search and work together with video content material in the identical intuitive approach you suppose and write. Find related content material with out the necessity for extra metadata.

Innovate responsibly

Evaluation the accountable AI rules to learn the way we’re dedicated to growing AI programs that assist make the world extra accessible. We’re centered on serving to organizations take full benefit of AI, and we’re investing closely in packages that present know-how, sources, and experience to empower these working to create a extra sustainable, protected, and accessible world.

Get began in the present day with Azure Cognitive Service for Imaginative and prescient

Revolutionize your laptop imaginative and prescient purposes with improved effectivity, accuracy, and accessibility in picture and video processing, on the identical low value. Go to Imaginative and prescient Studio to check out our newest demos.

Study extra about Azure Cognitive Service for Imaginative and prescient:

Newsletter Updates

Enter your email address below to subscribe to our newsletter

Leave a Reply