日韩中文字幕在线一区二区三区,亚洲热视频在线观看,久久精品午夜一区二区福利,精品一区二区三区在线观看l,麻花传媒剧电影,亚洲香蕉伊综合在人在线,免费av一区二区三区在线,亚洲成在线人视频观看
          首頁(yè) 500強(qiáng) 活動(dòng) 榜單 商業(yè) 科技 商潮 專題 品牌中心
          雜志訂閱

          《寶可夢(mèng)GO》玩家拍下300億張實(shí)景照片,如今用來(lái)訓(xùn)練機(jī)器人送披薩

          Catherina Gioino
          2026-03-25

          Niantic Spatial的視覺(jué)定位系統(tǒng)VPS解決了長(zhǎng)期阻礙自動(dòng)配送行業(yè)發(fā)展的問(wèn)題。

          文本設(shè)置
          小號(hào)
          默認(rèn)
          大號(hào)
          Plus(0條)

          《寶可夢(mèng)GO》玩家拍攝現(xiàn)實(shí)世界的照片,構(gòu)建了迄今為止最詳細(xì)的地圖。圖片來(lái)源:Mike Coppola/Getty Images

          街角隨處可見(jiàn)皮卡丘,進(jìn)道館前先升級(jí),還有“去投票站抓寶可夢(mèng)”活動(dòng)……你一定記得那個(gè)時(shí)代,《寶可夢(mèng)GO》引發(fā)狂熱,上億人為捕捉稀有的亞特諾姆或特別版噴火龍走上街頭。如今看來(lái),《寶可夢(mèng)GO》不僅風(fēng)靡全球,還利用眾包數(shù)據(jù)繪制出世界地圖。

          過(guò)去10年里,《寶可夢(mèng)GO》玩家主動(dòng)上傳各地公共地標(biāo)、街角、店面和城市十字路口的照片和短視頻,最終匯聚成了包含300億張街景實(shí)拍圖像的數(shù)據(jù)集,覆蓋全球幾乎所有主要城市。企業(yè)級(jí)人工智能與地圖部門Niantic Spatial從Niantic公司拆分,耗時(shí)多年將這批海量數(shù)據(jù)轉(zhuǎn)化為機(jī)器人行業(yè)前所未見(jiàn)的成果:專為機(jī)器人打造的照片級(jí)真實(shí)感,街道級(jí)精度,可持續(xù)更新的物理世界模型。

          目前這一模型已用于Coco Robotics旗下約1000輛配送機(jī)器人的導(dǎo)航。這些機(jī)器人在全美及全球多個(gè)城市運(yùn)營(yíng),包括洛杉磯、芝加哥、邁阿密、澤西城和芬蘭赫爾辛基,迄今已完成數(shù)百萬(wàn)英里的配送任務(wù)。Niantic Spatial首席技術(shù)官,也是谷歌地球創(chuàng)始團(tuán)隊(duì)成員之一布萊恩·麥克倫登清晰解釋了個(gè)中數(shù)據(jù)策略。

          “我們將玩家數(shù)據(jù)當(dāng)成高質(zhì)量的地面訓(xùn)練數(shù)據(jù),用來(lái)優(yōu)化其他質(zhì)量較低的數(shù)據(jù)集,”麥克倫登給《財(cái)富》的一份聲明中表示,“Niantic Spatial長(zhǎng)期理念是,利用高度集中的地點(diǎn)訓(xùn)練模型,從而解決定位、重建和語(yǔ)義理解等難題,然后利用更廣泛可用但分辨率較低的數(shù)據(jù),實(shí)現(xiàn)從‘劣質(zhì)’數(shù)據(jù)中完成定位、可視化和理解。”

          300億張《寶可夢(mèng)GO》圖像不僅僅是一張地圖,更是創(chuàng)建現(xiàn)實(shí)世界實(shí)時(shí)地圖的萬(wàn)能鑰匙。玩家的掃描向模型展示“精確”的含義。模型精確到甚至能在輸入數(shù)據(jù)不完美時(shí)及時(shí)提醒。這一策略使 Niantic Spatial的定位不再僅僅是轉(zhuǎn)型的游戲公司,而是有史以來(lái)最宏大的地圖測(cè)繪行動(dòng),完全由用戶捕捉數(shù)字生物的熱情資助的項(xiàng)目。

          Niantic Spatial的視覺(jué)定位系統(tǒng)VPS解決了長(zhǎng)期阻礙自動(dòng)配送行業(yè)發(fā)展的問(wèn)題。多數(shù)導(dǎo)航系統(tǒng)的核心是全球定位系統(tǒng) (GPS),在高樓林立的城市環(huán)境中表現(xiàn)不佳,因?yàn)楦邔咏ㄖ?huì)干擾衛(wèi)星信號(hào)。送餐機(jī)器人的目標(biāo)是將食物精準(zhǔn)配送到特定門口,幾米誤差就可能導(dǎo)致顧客投訴漢堡變涼,或者送錯(cuò)到鄰居家門口。相比之下,視覺(jué)定位系統(tǒng)完全繞過(guò)了衛(wèi)星,將機(jī)器人攝像頭的實(shí)時(shí)畫(huà)面與海量圖像數(shù)據(jù)庫(kù)比對(duì),實(shí)現(xiàn)實(shí)時(shí)定位。

          “模型將實(shí)時(shí)接收來(lái)自機(jī)器人的圖像,將其與公開(kāi)數(shù)據(jù)集以及專有數(shù)據(jù)集比對(duì),確定機(jī)器人的全球位置和航向,”Niantic Spatial一位發(fā)言人在給《財(cái)富》的聲明中表示。該公司清楚這項(xiàng)技術(shù)在何處表現(xiàn)最佳:“Niantic Spatial 的視覺(jué)定位系統(tǒng)在GPS表現(xiàn)不佳的城市峽谷中尤為穩(wěn)定可靠。”

          “最初的視覺(jué)定位系統(tǒng)依托用戶在游戲中主動(dòng)選擇拍攝的掃描數(shù)據(jù)構(gòu)建,但模型并不會(huì)依賴單一數(shù)據(jù)源,”該發(fā)言人說(shuō)道。玩家參與始終自愿,必須主動(dòng)選擇提交特定公共地標(biāo)的視頻。如今,該模型逐漸通過(guò)Niantic Spatial企業(yè)客戶自行生成的數(shù)據(jù)學(xué)習(xí)。其底層引擎是一個(gè)大型地理空間模型,通過(guò)數(shù)十億張姿態(tài)圖像和數(shù)億次現(xiàn)實(shí)世界掃描的訓(xùn)練后,已具備三大能力:將空間重建為可導(dǎo)航的3D模型,在空間內(nèi)定位機(jī)器,以及在語(yǔ)義上理解環(huán)境。正如首席執(zhí)行官約翰·漢克在近期一篇博客文章中所寫:“過(guò)去幾年,我們一直在構(gòu)建大型地理空間模型,相當(dāng)于鮮活的世界地圖,天生服務(wù)于機(jī)器人和人工智能的地圖。”

          在Coco首席執(zhí)行官扎克·拉什看來(lái),機(jī)器人(缺乏)批判思維能力是問(wèn)題所在。

          “機(jī)器人沒(méi)有人類的直覺(jué),人類可以理解‘我的GPS不太管用,但大概知道該往哪走’,”拉什告訴《財(cái)富》,“我們需要機(jī)器人也有那種直覺(jué)。”

          “進(jìn)入高樓林立的密集區(qū)域時(shí),視覺(jué)定位系統(tǒng)解決方案作用就非常大,”拉什說(shuō),“那種環(huán)境下,GPS和現(xiàn)有解決方案可能會(huì)失效。”

          他指出,配送的最后一刻會(huì)直接影響顧客體驗(yàn):“如果機(jī)器人在錯(cuò)的地方傻等,顧客體驗(yàn)會(huì)很糟糕。”

          “與Niantic Spatial合作還處于早期階段,但能跟如此優(yōu)秀的團(tuán)隊(duì)協(xié)作,探索如何將技術(shù)融入現(xiàn)有技術(shù)以提升服務(wù)質(zhì)量,我們都覺(jué)得很興奮。視覺(jué)定位系統(tǒng)顯然是理想選擇,”拉什繼續(xù)說(shuō)道,“他們能力非常強(qiáng)。如果送餐時(shí)定位更準(zhǔn)確,就能讓顧客滿意。”(財(cái)富中文網(wǎng))

          譯者:梁宇

          審校:夏林

          街角隨處可見(jiàn)皮卡丘,進(jìn)道館前先升級(jí),還有“去投票站抓寶可夢(mèng)”活動(dòng)……你一定記得那個(gè)時(shí)代,《寶可夢(mèng)GO》引發(fā)狂熱,上億人為捕捉稀有的亞特諾姆或特別版噴火龍走上街頭。如今看來(lái),《寶可夢(mèng)GO》不僅風(fēng)靡全球,還利用眾包數(shù)據(jù)繪制出世界地圖。

          過(guò)去10年里,《寶可夢(mèng)GO》玩家主動(dòng)上傳各地公共地標(biāo)、街角、店面和城市十字路口的照片和短視頻,最終匯聚成了包含300億張街景實(shí)拍圖像的數(shù)據(jù)集,覆蓋全球幾乎所有主要城市。企業(yè)級(jí)人工智能與地圖部門Niantic Spatial從Niantic公司拆分,耗時(shí)多年將這批海量數(shù)據(jù)轉(zhuǎn)化為機(jī)器人行業(yè)前所未見(jiàn)的成果:專為機(jī)器人打造的照片級(jí)真實(shí)感,街道級(jí)精度,可持續(xù)更新的物理世界模型。

          目前這一模型已用于Coco Robotics旗下約1000輛配送機(jī)器人的導(dǎo)航。這些機(jī)器人在全美及全球多個(gè)城市運(yùn)營(yíng),包括洛杉磯、芝加哥、邁阿密、澤西城和芬蘭赫爾辛基,迄今已完成數(shù)百萬(wàn)英里的配送任務(wù)。Niantic Spatial首席技術(shù)官,也是谷歌地球創(chuàng)始團(tuán)隊(duì)成員之一布萊恩·麥克倫登清晰解釋了個(gè)中數(shù)據(jù)策略。

          “我們將玩家數(shù)據(jù)當(dāng)成高質(zhì)量的地面訓(xùn)練數(shù)據(jù),用來(lái)優(yōu)化其他質(zhì)量較低的數(shù)據(jù)集,”麥克倫登給《財(cái)富》的一份聲明中表示,“Niantic Spatial長(zhǎng)期理念是,利用高度集中的地點(diǎn)訓(xùn)練模型,從而解決定位、重建和語(yǔ)義理解等難題,然后利用更廣泛可用但分辨率較低的數(shù)據(jù),實(shí)現(xiàn)從‘劣質(zhì)’數(shù)據(jù)中完成定位、可視化和理解。”

          300億張《寶可夢(mèng)GO》圖像不僅僅是一張地圖,更是創(chuàng)建現(xiàn)實(shí)世界實(shí)時(shí)地圖的萬(wàn)能鑰匙。玩家的掃描向模型展示“精確”的含義。模型精確到甚至能在輸入數(shù)據(jù)不完美時(shí)及時(shí)提醒。這一策略使 Niantic Spatial的定位不再僅僅是轉(zhuǎn)型的游戲公司,而是有史以來(lái)最宏大的地圖測(cè)繪行動(dòng),完全由用戶捕捉數(shù)字生物的熱情資助的項(xiàng)目。

          Niantic Spatial的視覺(jué)定位系統(tǒng)VPS解決了長(zhǎng)期阻礙自動(dòng)配送行業(yè)發(fā)展的問(wèn)題。多數(shù)導(dǎo)航系統(tǒng)的核心是全球定位系統(tǒng) (GPS),在高樓林立的城市環(huán)境中表現(xiàn)不佳,因?yàn)楦邔咏ㄖ?huì)干擾衛(wèi)星信號(hào)。送餐機(jī)器人的目標(biāo)是將食物精準(zhǔn)配送到特定門口,幾米誤差就可能導(dǎo)致顧客投訴漢堡變涼,或者送錯(cuò)到鄰居家門口。相比之下,視覺(jué)定位系統(tǒng)完全繞過(guò)了衛(wèi)星,將機(jī)器人攝像頭的實(shí)時(shí)畫(huà)面與海量圖像數(shù)據(jù)庫(kù)比對(duì),實(shí)現(xiàn)實(shí)時(shí)定位。

          “模型將實(shí)時(shí)接收來(lái)自機(jī)器人的圖像,將其與公開(kāi)數(shù)據(jù)集以及專有數(shù)據(jù)集比對(duì),確定機(jī)器人的全球位置和航向,”Niantic Spatial一位發(fā)言人在給《財(cái)富》的聲明中表示。該公司清楚這項(xiàng)技術(shù)在何處表現(xiàn)最佳:“Niantic Spatial 的視覺(jué)定位系統(tǒng)在GPS表現(xiàn)不佳的城市峽谷中尤為穩(wěn)定可靠。”

          “最初的視覺(jué)定位系統(tǒng)依托用戶在游戲中主動(dòng)選擇拍攝的掃描數(shù)據(jù)構(gòu)建,但模型并不會(huì)依賴單一數(shù)據(jù)源,”該發(fā)言人說(shuō)道。玩家參與始終自愿,必須主動(dòng)選擇提交特定公共地標(biāo)的視頻。如今,該模型逐漸通過(guò)Niantic Spatial企業(yè)客戶自行生成的數(shù)據(jù)學(xué)習(xí)。其底層引擎是一個(gè)大型地理空間模型,通過(guò)數(shù)十億張姿態(tài)圖像和數(shù)億次現(xiàn)實(shí)世界掃描的訓(xùn)練后,已具備三大能力:將空間重建為可導(dǎo)航的3D模型,在空間內(nèi)定位機(jī)器,以及在語(yǔ)義上理解環(huán)境。正如首席執(zhí)行官約翰·漢克在近期一篇博客文章中所寫:“過(guò)去幾年,我們一直在構(gòu)建大型地理空間模型,相當(dāng)于鮮活的世界地圖,天生服務(wù)于機(jī)器人和人工智能的地圖。”

          在Coco首席執(zhí)行官扎克·拉什看來(lái),機(jī)器人(缺乏)批判思維能力是問(wèn)題所在。

          “機(jī)器人沒(méi)有人類的直覺(jué),人類可以理解‘我的GPS不太管用,但大概知道該往哪走’,”拉什告訴《財(cái)富》,“我們需要機(jī)器人也有那種直覺(jué)。”

          “進(jìn)入高樓林立的密集區(qū)域時(shí),視覺(jué)定位系統(tǒng)解決方案作用就非常大,”拉什說(shuō),“那種環(huán)境下,GPS和現(xiàn)有解決方案可能會(huì)失效。”

          他指出,配送的最后一刻會(huì)直接影響顧客體驗(yàn):“如果機(jī)器人在錯(cuò)的地方傻等,顧客體驗(yàn)會(huì)很糟糕。”

          “與Niantic Spatial合作還處于早期階段,但能跟如此優(yōu)秀的團(tuán)隊(duì)協(xié)作,探索如何將技術(shù)融入現(xiàn)有技術(shù)以提升服務(wù)質(zhì)量,我們都覺(jué)得很興奮。視覺(jué)定位系統(tǒng)顯然是理想選擇,”拉什繼續(xù)說(shuō)道,“他們能力非常強(qiáng)。如果送餐時(shí)定位更準(zhǔn)確,就能讓顧客滿意。”(財(cái)富中文網(wǎng))

          譯者:梁宇

          審校:夏林

          Pikachus at every street corner. Leveling up before getting into the gym. “Pokémon Go to the polls.” You remember this era well: Pokémon Go became a frenzy, with hundreds of millions taking to the streets for their chance to snap up the rare Azelf or special edition Charizard. Now, not only does it seem that Pokémon Go took the world by storm, but it also was using crowdsourced data to map it.

          Over the past decade, Pokémon Go players voluntarily submitted photos and short videos of public landmarks, street corners, storefronts, and urban intersections—all coming together to create a dataset that now stands at 30 billion images captured at ground level, across nearly every major city on the planet. Niantic Spatial, the enterprise AI and mapping division spun from Niantic Inc., has spent years converting that trove into something the robotics industry has never seen before: a photorealistic, street-level, continuously updated model of the physical world, built specifically for robots.

          That model is now being deployed to navigate Coco Robotics’ roughly 1,000 delivery bot fleet operating in cities across the country and around the world, including Los Angeles, Chicago, Miami, Jersey City, and Helsinki, logging millions of miles of deliveries to date. Brian McClendon, Niantic Spatial’s chief technology officer and one of the original creators of Google Earth, explains the data strategy plainly.

          “We look at the player data as very high-quality ground training data for other lower-quality datasets,” McClendon told Fortune in a statement. “The long-term philosophy of Niantic Spatial is that we can solve these hard problems of localization, reconstruction, and semantics by using very concentrated places to train models and then use much more broadly available data at lower resolution to be able to localize, visualize, and understand from ‘bad’ data.”

          The 30 billion Pokémon Go images aren’t just a map: They are a master key that unlocks the potential of how to create a real-world, real-time map. The player scans teach the model what precision looks like—it’s so precise, in fact, that it can even signal when the input is imperfect. It’s a strategy that positions Niantic Spatial less as a gaming company that pivoted and more as the most ambitious mapping operation ever assembled—one that was funded entirely by its own users’ enthusiasm for catching digital creatures.

          Niantic Spatial’s Visual Positioning System, or VPS, solves a problem that has quietly stunted the autonomous delivery industry. GPS, the backbone of most navigation systems, doesn’t fare that well in dense urban environments, where tall buildings interfere with satellite signals. For a delivery robot that needs to drop food at a precise doorstep, being several feet off means unhappy customers complaining their burger is cold—or in their neighbor’s tummy. Instead, the VPS bypasses satellites entirely, comparing live camera feeds from the robot against its vast image database to determine position in real time.

          “The model will work in real time, taking in images from the robot and comparing them to both publicly available as well as proprietary datasets we’ve collected to determine the robot’s global position and heading,” a Niantic Spatial spokesperson told Fortune in a statement. The company knew where this tech performs best: “Niantic Spatial’s VPS is particularly resilient in urban canyons where GPS performs badly.”

          “Our initial VPS was built using scans that users choose to take in games—but no single source defines the model,” the Niantic Spatial spokesperson said. Player participation was always opt-in: users had to actively choose to submit a short video scan of a specific public landmark. Today, the model increasingly learns from the data Niantic Spatial’s enterprise customers generate themselves. The underlying engine—a large geospatial model, or LGM, trained on billions of posed images and hundreds of millions of real-world scans—powers three capabilities: reconstructing spaces as navigable 3D models, localizing machines within those spaces, and understanding environments semantically. As CEO John Hanke wrote in a recent blog post: “For the past several years, we’ve been building a large geospatial model that acts as a living, breathing map of the world, one that is native to robots and AI.”

          For Coco CEO Zach Rash, the problem is with robots’ critical thinking skills (or lack thereof).

          “Robots don’t have the same intuition yet as a human, where a human can understand, ‘My GPS isn’t really working, but I understand that’s probably the right place to go,'” Rash told Fortune. “We need the robot to have that sort of intuition.”

          “When we go into really dense areas with high rises, that’s where the VPS solution can be really helpful,” Rash said. “Our GPS and our existing solutions might fail in that sort of environment.”

          The stakes, he noted, are felt by customers at the very last moment of a delivery: “It is a terrible customer experience if the robot parks in the wrong place waiting to receive that order.”

          “It’s very early with [Niantic Spatial], and I think we’re excited to collaborate with such an incredible team on figuring out how we add this toward existing technology to make the service better. VPS is an obvious one,” Rash continued. “They’re very good at doing this. If I can more precisely figure out where to drop off food, my customers will be happy.”

          財(cái)富中文網(wǎng)所刊載內(nèi)容之知識(shí)產(chǎn)權(quán)為財(cái)富媒體知識(shí)產(chǎn)權(quán)有限公司及/或相關(guān)權(quán)利人專屬所有或持有。未經(jīng)許可,禁止進(jìn)行轉(zhuǎn)載、摘編、復(fù)制及建立鏡像等任何使用。
          0條Plus
          精彩評(píng)論
          評(píng)論

          撰寫或查看更多評(píng)論

          請(qǐng)打開(kāi)財(cái)富Plus APP

          前往打開(kāi)