Virtual Background in webcam with Body Segmentation technique

栏目: IT技术 · 发布时间: 5年前

内容简介：Did you take some selfies above and show that to your friends? I hope you like it, this app utilized an advanced technique calledEarly this year, Google releasesFirst of all, simply include the script

Implementation

Did you take some selfies above and show that to your friends? I hope you like it, this app utilized an advanced technique called Body Segmentation , which can identify human being in an image or video stream, and segment the foreground body from the background.

Early this year, Google releases BodyPix , an open-source machine learning model which allows for person and body-part segmentation in the browser with TensorFlow.js . I was amazed by this technology, and come up with the idea of building the above Selfie Anywhere application. Follow me below for the journey of how I implemented it.

# Step 1 : Include tfjs and body-pix

First of all, simply include the script Tensorflow.js and its body-pix model in the <head> section of the html file.

<script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs@1.2"></script><script src="https://cdn.jsdelivr.net/npm/@tensorflow-models/body-pix@2.0"></script>

Or you can install it via npm for use in a TypeScript / ES6 project

npm install @tensorflow-models/body-pix

# Step 2 : Stream webcam to browser

To stream your webcam into the browser, I utilize the JavaScript library navigator.mediaDevices.getUserMedia . To find out more details about that, please refer to my previous article :

# Step 3 : Load BodyPix Model

In order to process segmentation, we first need to load the pre-trained BodyPix model, by calling the API of bodyPix.load(modelConfig) . BodyPix comes with a few different versions of the model, with different performance characteristics trading off model size and prediction time with accuracy.

By default, BodyPix loads a MobileNetV1 architecture with a 0.75 multiplier. This is recommended for computers with mid-range/lower-end GPUs. A model with a 0.50 multiplier is recommended for mobile. The ResNet architecture is recommended for computers with even more powerful GPUs.

bodyPix.load({
 architecture: 'MobileNetV1',
 outputStride: 16,
 multiplier: 0.75,
 quantBytes: 2
})

# Step 4 : Body segmentation

Next, we start to feed the webcam stream through the body-pix model to perform person segmentation, by calling the API of net.estimatePersonSegmentation(video, outputStride, segmentationThreshold) . It segments an image into pixels that are and aren’t part of a person. It returns a binary array with 1 for the pixels that are part of the person, and 0 otherwise. The array size corresponds to the number of pixels in the image.

net.segmentPerson(webcamElement, {
 flipHorizontal: true,
 internalResolution: 'medium',
 segmentationThreshold: 0.5
 })
.then(personSegmentation => {
 if(personSegmentation!=null){
 drawBody(personSegmentation);
 }
});
cameraFrame = requestAnimFrame(detectBody);

flipHorizontal defaults to false. If the segmentation & pose should be flipped/mirrored horizontally. This should be set to true for videos where the video is by default flipped horizontally (i.e. a webcam), and you want the segmentation & pose to be returned in the proper orientation.

segmentationThreshold is used to determining the minimum value a pixel’s score must have to be considered part of a person. In essence, a higher value will create a tighter crop around a person but may result in some pixels being that are part of a person being excluded from the returned segmentation mask.

It returns a Promise that resolves with a SemanticPersonSegmentation object. Multiple people in the image get merged into a single binary mask. In addition to width , height , and data fields, it returns a field allPoses which contains poses for all people. The data array for the all people containing 307200 values, one for each pixel of the 640×480 image.

{
 width: 640,
 height: 480,
 data: Uint8Array(307200) [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 1, …],
 allPoses: [{"score": 0.4, "keypoints": […]}, …]
}

# Step 5 : Remove Background

In the above function, we get the binary array to indicate pixels belong to body or not, now we can use it to remove the background, and only draw the body on a canvas. In an ImageData object, each pixel holds the values of red, green, blue, and alpha (transparency), the trick to remove the background is by setting the pixel’s transparency value to 0.

const canvasPerson = document.getElementById("canvasPerson");
let contextPerson = canvasPerson.getContext('2d');

function drawBody(personSegmentation)
{
 contextPerson.drawImage(camera, 0, 0, camera.width, camera.height);
 var imageData = contextPerson.getImageData(0,0, camera.width, camera.height);
 var pixel = imageData.data;
 for (var p = 0; p<pixel.length; p+=4)
 {
if (personSegmentation.data[p/4] == 0) {
 pixel[p+3] = 0;
 }
 }
 contextPerson.imageSmoothingEnabled = true;
 contextPerson.putImageData(imageData,0,0);
}

# Step 6 : Overlay canvas above background image

Once we had the canvas that only contains the body with a transparent background, then we just need to overlay it on top of a background image of the breathtaking nature scene.

<video id="webcam" autoplay playsinline width="640" height="480"></video>
<div id="selfie-container">
<div id="background-container"></div>
 <canvas id="canvasPerson" width="640" height="480"></canvas>
</div>

Applying css style below

#background-container {
 height: 100vh;
 width: 100vw;
background-image: url(../images/greatwall.jpg);
 background-position: center center;
 background-repeat: no-repeat;
 background-size: cover;
 background-color: transparent;
}

#canvasPerson{
background-color: transparent;
 position: absolute;
 width: 100vw;
 height: auto;
 z-index: 9999;
 margin: auto;
 top: 0;
 bottom: 0;
 left: 0;
 right: 0;
 margin-left: auto;
 margin-right: auto;
 -moz-transform: scale(-1, 1);
 -webkit-transform: scale(-1, 1);
 -o-transform: scale(-1, 1);
 transform: scale(-1, 1);
 filter: FlipH;
}

# Step 7 : Take screenshot

For taking the picture, I am using a 3rd party JavaScript library html2canvas.js . It allows you to take “screenshots” of web pages or parts of it, directly on the users browser.

$("#take-photo").click(function () {
 beforeTakePhoto();
 var captureElement= document.getElementById('selfie-container');
 var appendElement= document.getElementById('webcam-container');
html2canvas(captureElement).then(function(canvas) {
 canvas.id='captureCanvas';
 appendElement.appendChild(canvas);
 document.querySelector('#download-photo').href = canvas.toDataURL('image/png');
 afterTakePhoto();
 });
});

That’s pretty much for the code! Other than that are just making the demo look nice. Choose one of those spectacular scene, set your favorite pose and smile!

以上就是本文的全部内容，希望本文的内容对大家的学习或者工作能带来一定的帮助，也希望大家多多支持码农网

查看所有标签

猜你喜欢:

Virtual Background in webcam with Body Segmentation technique

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

大学的终结

[美] 凯文·凯里（Kevin Carey） / 朱志勇、韩倩 / 人民邮电出版社 / 2017-2-28 / 59.00

你了解目前全球高等教育的现状吗？你知道高等教育的未来是什么样的吗？你听说过泛在大学吗？翻开本书，了解大学的过去、现在与未来。《大学的终结：泛在大学与高等教育革命》一书由美国著名教育作家凯文? 凯里倾情打造。作者在书中详细论述了美国大学的历史变迁、大学的本质、大学的未来、信息技术与教育的关系、泛在大学的定义、传统大学在大趋势下的挣扎，以及未来高等教育的学历认证与呈现形式。本书作者用缜密的逻辑......一起来看看《大学的终结》这本书的介绍吧!

码农工具

HTML 压缩/解压工具

在线压缩/解压 HTML 代码

RGB转16进制工具

RGB HEX 互转工具