微軟Bing開放上傳照片並回答你的複雜問題[2023-10-12 17:45:31]

有關室內施工場景的影像識別,結果我非常滿意!
微軟Bing影像辨識 與室內設計 評論應用!

多模態生成式AI

Multimodal AI: the basics多模態生成式AI
Let’s start with modes. Think of a mode like a human sense. You might see and taste a carrot, for instance. You would be able to identify that you were eating a carrot faster than if you had to eat the carrot blindfolded. You could also identify the carrot if you could see but not taste it. If it was not carrot shaped (eg puree) you might still guess it was carrot from the colour. But if you could eat that puree as well, you could get confirmation from the flavour. That’s multimodal AI in a nutshell. It’s a combination of different inputs, allowing the learning intelligence to infer a more accurate result from multiple inputs.
我們先從模式開始說起。你可以把模式想像成人類的感官。例如,你可能會看到和嚐到一根胡蘿蔔。如果你不戴眼罩吃胡蘿蔔,你會比較快地辨認出你在吃胡蘿蔔。即使你不能嚐到,只能看到胡蘿蔔,你也可以辨認出它。如果胡蘿蔔不是原來的形狀(例如泥狀),你可能還是可以從顏色猜出它是胡蘿蔔。但如果你能同時吃到那個泥狀的東西,你就可以從味道得到確認。這就是多模態人工智慧的概念。它是不同輸入的組合,讓學習的智慧能夠從多個輸入推斷出更準確的結果。

微軟新版Bing搜尋已注入GPT-4技術 識別圖像、複雜搜尋互動行為沒問題


亞大室內系一年級 | Created by openblogmini.sourceforge.net | CSS by Boostrap
[ Create Entry ] [ Admin: Modify Entry ] © 2015, 2016
Top