simple@lemmy.world to

Technology@lemmy.worldEnglish · 1 year ago

KOSMOS-2: Microsoft's New AI Breakthrough Generating Text, Images, Video & Sound in Real-Time! (July 7, 2023)

4

81

KOSMOS-2: Microsoft's New AI Breakthrough Generating Text, Images, Video & Sound in Real-Time! (July 7, 2023)

simple@lemmy.world to

Technology@lemmy.worldEnglish · 1 year ago

4

cross-posted from: https://lemmy.world/post/1134694

KOSMOS-2: Microsoft’s New AI Breakthrough Generating Text, Images, Video & Sound in Real-Time!

Microsoft has unveiled its latest AI breakthrough, KOSMOS-2, which can generate text, images, video, and sound in real-time[1]. This multimodal large language model (MLLM) is grounded in the real world through its ability to understand and analyze image content[4]. It was trained using large-scale data of grounded image-text pairs called GrIT[2].

KOSMOS-2 is a significant step forward in AI technology, with its ability to generate content across multiple modalities[6]. It has the potential to revolutionize computer vision applications with improved efficiency, accuracy, and accessibility in image and video processing[3].

This breakthrough is a testament to Microsoft’s commitment to advancing AI technology and its potential to transform industries across the board. We can’t wait to see what the future holds with KOSMOS-2!

Citations: [1] https://youtube.com/watch?v=VxsqtoytLsA [2] https://www.microsoft.com/en-us/research/publication/kosmos-2-grounding-multimodal-large-language-models-to-the-world/ [3] https://azure.microsoft.com/en-us/blog/announcing-a-renaissance-in-computer-vision-ai-with-microsofts-florence-foundation-model/ [4] https://arstechnica.com/information-technology/2023/03/microsoft-unveils-kosmos-1-an-ai-language-model-with-visual-perception-abilities/ [5] https://www.linkedin.com/posts/trishuhl_generativeai-multimodal-ai-activity-7040590986057564160-ImJp [6] https://www.cjco.com.au/article/news/unleashing-the-power-of-kosmos-2-a-leap-forward-in-ai-tech-with-grounded-multimodal-language-models/

Chat

Impulsivedoorholder
link
fedilink
English
arrow-up
4·
1 year ago
502 still, but thanks for the link! This is super interesting. I’m curious to see where it goes.

Technology@lemmy.world

technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

2.92K users / day
8.6K users / week
17.7K users / month
39.3K users / 6 months
449 local subscribers
55.7K subscribers
9.87K Posts
447K Comments
Modlog