Skip to content

AI

Tensorflow.js POC 13: Avatar Generator with Face-API.js

Overview

QuickPOC
Figure 1 Computer Vision in AI

Avatar Generator with Face-API.js

This POC, a consequential POC of face-api.js, regonize the face from camera and find the nearest avatar from thousands avatars generated from avatar generators.

References

U2Net

SOD

SOD (Salient Object Detection) is a topics in deep learning that by given a image, SOD can automatically segmentize the most interested objects of the image without any hints. SOD learns how human see the interested objects by detecting the denisity of feature points and segmentize the most dense parts. So far, U2Net provide a state of art performance.

First results of U2Net

These are the first results of the U2Net on target benchmark images. For the full results can be checked in Chimay-SOD1 and asubset Chimay-SOD2 can be found.

{% include ideal-image-slider/slider.html selector="slider1" %}

Image sliders

In this page, image slider for jekyll and its js code is used for image slider. Also a Jekyll Ideal Image Slider Include Demo shows the possiblity of Ideal Image Slider.

References

Recommendation

Overview

DataForecastinAllVision
Figure 1 DataForecast in AI

pinreset and its pin alogirthm

Amplitude Based Recommendation

amplitude user cohort lists

Here gives a demo for amplitude cohort download and query JSON-Server for Amplitude User Cohorts

References

Pytorch POC 2: OpenTTS

Git Repo Status Progress Comments
OpenTTS status progress Pytorch POC #2
mozillatts status progress Pytorch POC #3
MaryTTS status progress Pytorch POC #4

Based on last time keyword spotting topics on Chimay, I even mention items about TTS (text-to-speech) and showed POCs. Here I adopt Opentts to create a API server for speech and later ultrasound generation from Web.

Opentts

In live opentts demo site, you can check the conventional (non-deep learning) speech synthesis (marytts, nanotts) and deep-learning ones (Mozillatts with Tacotron and Tacotron2). Deep-learing ones provide a beeter speech quality. A public MOS test results as below also show similar conclusions.

MOS

Demo wave file as

Demo wave

Swagger API also includes the following:

Opentts swagger

The following diagram is from mozzila project. It shows the whole picture of nature lanaugege iteration with end users. But, of course, it will be a long way to go.

References

Dataset

Datasets

Datasets for Faces

Datasets for Images

Datasets for Images on Salient Object Detection
  • ECSSD
  • CSSD
  • DUTS-TE
  • DUTS-TR
  • salObj
  • DUT-OMRON
  • chimay-SOD1- chimay SOD testset #1

    • Download from wget
    wget -r -c -nH -np --cut-dirs=1 --content-disposition --no-check-certificate -U "Mozilla/5.0 (Android) Nextcloud-android/3.8.0" -O output.zip  http://dlc.barco.com:9980/s/m6dwLSan8M97YgW/download
    

Datasets for Audio

Datasets for WIFI locations

Datasets for movie rating

Datasets for animations

Datasets for NLP

References