研究開発室の馮 志聖(マイク)です。
First I will talk about remove background use semantic segmentation on ios.
And I will extend this to AR Cut and Paste.
Second I will talk about use Real-time Text Recognition with Document Classifier on ios.
AR Cut & Paste Introduction
Sometimes we walk on street and see some cool things (like flower or animals).
And want to crop the target and didn't need the background.
Normal case : take a photo => use image processing tools remove background.
It take long time.
So I want to make it smarter.
I use semantic segmentation with deeplab model + ARKit.
And just take a photo and paste some where.
This is semantic segmentation with deeplab model.
https://ai.googleblog.com/2018/03/semantic-image-segmentation-with.html
AR Cut & Paste
This is structure.
AR Cut & Paste Demo
This is simple input image and output image.
And all images is get from Pixabay.
Free-PhotosによるPixabayからの画像
AR Cut & Paste Final
It is cool application for create a AR photo.
This is demo for AR Cut & Paste.
Real-time Text Recognition with Document Classifier Introduction
Sometimes if we need to classify many documents or poster.
It need long time to do it.
So I try to create Real-time Text Recognition with Document Classifier.
It help us use smart way to classifier the document.
And just focus the the mobile camera on content.
It will automatic classify the content.
Real-time Text Recognition with Document Classifier
This is structure.
It have two main part of this application.
First is Google MLKit Text Recognition.
https://developers.google.com/vision/android/text-overview
This is OCR system flow chart.
https://www.researchgate.net/figure/Flow-Chart-for-OCR-system_fig4_276108387
Second is Document Classifier.
This is Document Clustering using k-means algorithm flow chart.
Real-time Text Recognition with Document Classifier Demo
I use three kind of contents on Wiki for testing.
1.Wi-Fi = technology
2.Savings account = Business
3.Baseball = Sports
Real-time Text Recognition with Document Classifier Final
Use Google MLKit Text Recognition on ios speed is very fast.
And Document Classifier is fast too.
This application can use on real-time.
It is convenient for us to find out some kind of contents.
Other
Tell us what do you think about our result , or anything else that comes to mind.
We welcome all of your comments and suggestions.