See how ML Kit and ARKit play together

栏目: IOS · 发布时间: 6年前

内容简介：We at the Firebase office all enjoyed playing with Hanley Weng’s “ML Kit for Firebaseis a mobile SDK that extends Google Cloud’s machine learning (ML) expertise into Android and iOS apps in a powerful yet easy-to-use package. It includes easy-to-use Base

Source: See how ML Kit and ARKit play together from Firebase

We at the Firebase office all enjoyed playing with Hanley Weng’s “ CoreML-in-ARKit ” project. It displays 3D labels on top of images it detects in the scene. While the on-device detection provides a fast response, we wanted to build a solution that gave you the speed of the on-device model with the accuracy you can get from a cloud-based solution. Well, that’s exactly what we built with our MLKit-ARKit project. Read on to find out more about how we did it!

See how ML Kit and ARKit play together

How it all works

ML Kit for Firebaseis a mobile SDK that extends Google Cloud’s machine learning (ML) expertise into Android and iOS apps in a powerful yet easy-to-use package. It includes easy-to-use Base APIs and also offers the ability to bring your own custom TFLite models.

ARKitis Apple’s framework that combines device motion tracking, camera scene capture, advanced scene processing, and display conveniences to simplify the task of building an AR experience. You can use these technologies to create many kinds of AR experiences using either the back camera or front camera of an iOS device.

In this project we are pushing ARKit frames from the back camera into a queue. ML Kit processes these to find out the objects in that frame.

When the user taps the screen, ML Kit returns the detected label with the highest confidence. We then create a 3D bubble text and add it into the user’s scene.

How ML Kit works

ML Kit makes ML easy for all mobile developers, whether you have experience in ML or are new to the space. For those with more advanced use cases, ML Kit allows you to bring your own TFLite models, but for more common use cases, you can implement one of the easy-to-use Base APIs. These APIs cover use cases such as text recognition, image labeling, face detection and more, and are backed by models trained by Google Cloud. We’ll be using image labeling in our example.

Base APIs are available in two flavors: On-device and cloud-based. The on-device APIs are free to use and run locally, while the cloud-based ones provide higher accuracy and more precise responses. Cloud-based Vision APIs are free for the first 1000/API calls and paid after that. They provide the power of full-sized models from Google’s Cloud Vision APIs.

Hybrid approach

We are using the ML Kit on-device image labeling API to get a live feed of results while keeping our frame rate steady at 60fps. When the user taps the screen we fire up an async call to the Cloud image labeling API with the current image. When we get a response from this higher accuracy model, we update the 3D label on the fly. So while we are continuously running the on-device API and using its result as the initial source of information, the higher accuracy Cloud API is called on-demand and its results replaces on-device label eventually.

Which result to show?

While the on-device API is real-time with all the processing happening locally, the Cloud Vision API makes a network request to the Google Cloud backend, leveraging a larger, higher accuracy model. Given that we consider this the more precise response, in our app we replace the label provided by the on-device API with the result from Cloud Vision API when it arrives.

Try it yourself!

1. Clone the project

$ git clone https://github.com/FirebaseExtended/MLKit-ARKit.git

2. Install the pods and open the .xcworkspace file to see the project in Xcode.

$ cd MLKit-ARKit
$ pod install --repo-update
$ open MLKit-ARKit.xcworkspace

3. To set up the Firebase ML Kit in the sample app:

Follow these instructions for adding Firebase to your app .
Make sure to specify “com.google.firebaseextended.MLKit-ARKit” as the iOS project bundle ID.
Download the GoogleService-Info.plist file generated as part of adding Firebase to your app.
In Xcode, add the GoogleService-Info.plist file to your app, next to Info.plist .

At this point, the app should work using the on-device recognition.

4. (Optional) To set up Cloud Vision API in the sample app:

Switch your Firebase project to the Blaze plan

Only Blaze-level projects can use the Cloud Vision APIs. Follow these steps to switch your project to the Blaze plan and enable pay-as-you-go billing.
1. Open your project in the Firebase console .
2. Click on the MODIFY link in the lower left corner next to the currently selected Spark plan.
3. Select the Blaze plan and follow the instructions in the Firebase Console to add a billing account.
  ★ The cloud label detection feature is still free for first 1000 uses per month. Click here to see additional pricing details.

Go to the ML Kit section of the Firebase console and enable the “Cloud Based APIs” toggle at the top.

At this point, the app should update labels with more precise results from the Cloud Vision API.

除非特别声明，此文章内容采用知识共享署名 3.0 许可，代码示例采用 Apache 2.0 许可。更多细节请查看我们的服务条款。

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持码农网

查看所有标签

猜你喜欢:

See how ML Kit and ARKit play together

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

perl进阶

Randal L.Schwartz、brian d.foy、Tom Phoenix / 韩雷 / 人民邮电出版社 / 2015-10-1 / 69

本书是Learning Perl一书的进阶。学完本书之后，您可以使用Perl语言的特性编写从简单脚本到大型程序在内的所有程序，正是Perl语言的这些特性使其成为通用的编程语言。本书为读者深入介绍了模块、复杂的数据结构以及面向对象编程等知识。本书每章的篇幅都短小精悍，读者可以在一到两个小时内读完，每章末尾的练习有助于您巩固在本章所学的知识。如果您已掌握了Learning Perl中的内容并渴......一起来看看《perl进阶》这本书的介绍吧!

码农工具

See how ML Kit and ARKit play together

How it all works

How ML Kit works

Hybrid approach

Which result to show?

Try it yourself!

3. To set up the Firebase ML Kit in the sample app:

4. (Optional) To set up Cloud Vision API in the sample app:

perl进阶

图片转BASE64编码

URL 编码/解码

HEX CMYK 转换工具