Announcing a renaissance in pc imaginative and prescient AI with Microsoft’s Florence basis mannequin | Azure Blog and Updates

0
916
Announcing a renaissance in pc imaginative and prescient AI with Microsoft’s Florence basis mannequin | Azure Blog and Updates


Extract strong insights from picture and video content material with Azure Cognitive Service for Vision

We are happy to announce the general public preview of Microsoft’s Florence basis mannequin, educated with billions of text-image pairs and built-in as cost-effective, production-ready pc imaginative and prescient providers in Azure Cognitive Service for Vision. The improved Vision Services permits builders to create cutting-edge, market-ready, accountable pc imaginative and prescient functions throughout varied industries. Customers can now seamlessly digitize, analyze, and join their information to pure language interactions, unlocking highly effective insights from their picture and video content material to help accessibility, drive acquisition by means of search engine optimisation, shield customers from dangerous content material, improve safety, and enhance incident response occasions.

Microsoft was just lately named a Leader within the IDC MarketScape: Worldwide General-Purpose Computer Vision AI Software Platforms 2022 Vendor Assessment (doc #US49776422, November 2022). The new Vision Services improves content material discoverability with automated captioning, sensible cropping, classifying, background elimination, and looking for photographs. Furthermore, customers can monitor actions, analyze environments, and obtain real-time alerts with accountable AI controls. 

Reddit can be utilizing Vision Services to generate captions for lots of of tens of millions of photographs on its platform. Tiffany Ong, Reddit Product Manager of Consumer Product has mentioned,

“With Microsoft’s Vision know-how, we’re making it simpler for customers to find and perceive our content material. The newly created picture captions make Reddit extra accessible for everybody and provides redditors extra alternatives to discover our photographs, interact in conversations, and in the end construct connections and a way of group.”

Microsoft is harnessing the facility of the brand new Vision Services in Microsoft 365 apps like Teams, PowerPoint, Outlook, Word, Designer, OneDrive, along with the Microsoft Datacenter. Microsoft Teams is driving innovation within the digital area with the assistance of segmentation capabilities, taking digital conferences to the subsequent degree. PowerPoint, Outlook, and Word leverage picture captioning for automated alt-text to enhance accessibility. Microsoft Designer and OneDrive are utilizing improved picture tagging, picture search, and background era to simplify picture discoverability and modifying. Microsoft Datacenters are leveraging Vision Services to boost safety and infrastructure reliability.

At this week’s Microsoft Ability Summit, firms will find out how they’ll enhance the accessibility of their visible content material. We’ll share the way forward for our Seeing AI app and LinkedIn will share the advantages of using Vision Services to ship automated alt-text descriptions for picture evaluation. As a preview, Jennison Asuncion, LinkedIn’s Head of Accessibility Engineering Evangelism has mentioned,

“More than 40 percent of LinkedIn’s feed posts include at least one image. We want every member to have equal access to opportunity and are committed to ensuring that we make images accessible to our members who are blind or who have low vision so they can be a part of the online conversation. With Azure Cognitive Service for Vision, we can provide auto-captioning to edit and support alt. text descriptions. I’m excited about this new experience because now, not only will I know my colleague shared a picture from an event they attended, but that my CEO Ryan Roslansky is also in the picture.”

Try out the brand new out-of-the-box options our clients are utilizing in Vision Studio:

  • Dense captions: Automatically ship wealthy captions, design ideas, accessible alt-text, search engine optimisation optimization, and clever photograph curation to help digital content material.
  • Image retrieval: Improve search suggestions and commercials with pure language queries that seamlessly measure the similarity between photographs and textual content.

  • Background elimination: Transform the feel and appear of photographs by simply segmenting individuals and objects from their unique background, changing them with a most popular background scene.
  • Model customization: Lower prices and time to ship customized fashions that match distinctive enterprise calls for at excessive precision, and with only a handful of photographs.
  • Video summarization (Video TL;DR): Search and work together with video content material in the identical intuitive means you suppose and write. Locate related content material with out the necessity for extra metadata.

Innovate responsibly

Review the accountable AI ideas to find out how we’re dedicated to growing AI programs that assist make the world extra accessible. We are centered on serving to organizations take full benefit of AI, and we’re investing closely in packages that present know-how, sources, and experience to empower these working to create a extra sustainable, secure, and accessible world.

Get began at present with Azure Cognitive Service for Vision

Revolutionize your pc imaginative and prescient functions with improved effectivity, accuracy, and accessibility in picture and video processing, on the identical low value. Visit Vision Studio to check out our newest demos.

Learn extra about Azure Cognitive Service for Vision:

LEAVE A REPLY

Please enter your comment!
Please enter your name here