Kalidokit is a blendshape and kinematics solver for Mediapipe/Tensorflow.js face, eyes, pose, and hand tracking models

Rich

Last update: Jan 7, 2023

Related tags

Overview

KalidoKit - Face, Pose, and Hand Tracking Kinematics

Kalidokit is a blendshape and kinematics solver for Mediapipe/Tensorflow.js face, eyes, pose, and hand tracking models, compatible with Facemesh, Blazepose, Handpose, and Holistic. It takes predicted 3D landmarks and calculates simple euler rotations and blendshape face values.

As the core to Vtuber web apps, Kalidoface and Kalidoface 3D, KalidoKit is designed specifically for rigging 3D VRM models and Live2D avatars!

Install

Via NPM

npm install kalidokit

import * as Kalidokit from "kalidokit";

// or only import the class you need

import { Face, Pose, Hand } from "kalidokit";

Via CDN

<script src="https://cdn.jsdelivr.net/npm/[email protected]/dist/kalidokit.umd.js"></script>

Methods

Kalidokit is composed of 3 classes for Face, Pose, and Hand calculations. They accept landmark outputs from models like Facemesh, Blazepose, Handpose, and Holistic.

// Accepts an array(468 or 478 with iris tracking) of vectors
Kalidokit.Face.solve(facelandmarkArray, {
    runtime: "tfjs", // `mediapipe` or `tfjs`
    video: HTMLVideoElement,
    imageSize: { height: 0, width: 0 },
    smoothBlink: false, // smooth left and right eye blink delays
    blinkSettings: [0.25, 0.75], // adjust upper and lower bound blink sensitivity
});

// Accepts arrays(33) of Pose keypoints and 3D Pose keypoints
Kalidokit.Pose.solve(poseWorld3DArray, poseLandmarkArray, {
    runtime: "tfjs", // `mediapipe` or `tfjs`
    video: HTMLVideoElement,
    imageSize: { height: 0, width: 0 },
    enableLegs: true,
});

// Accepts array(21) of hand landmark vectors; specify 'Right' or 'Left' side
Kalidokit.Hand.solve(handLandmarkArray, "Right");

// Using exported classes directly
Face.solve(facelandmarkArray);
Pose.solve(poseWorld3DArray, poseLandmarkArray);
Hand.solve(handLandmarkArray, "Right");

Additional Utils

// Stabilizes left/right blink delays + wink by providing blenshapes and head rotation
Kalidokit.Face.stabilizeBlink(
    { r: 0, l: 1 }, // left and right eye blendshape values
    headRotationY, // head rotation in radians
    {
        noWink = false, // disables winking
        maxRot = 0.5 // max head rotation in radians before interpolating obscured eyes
    });

// The internal vector math class
Kalidokit.Vector();

Remixable VRM Template with KalidoKit

Quick-start your Vtuber app with this simple remixable example on Glitch. Face, full-body, and hand tracking in under 350 lines of javascript. This demo uses Mediapipe Holistic for body tracking, Three.js + Three-VRM for rendering models, and KalidoKit for the kinematic calculations. This demo uses a minimal amount of easing to smooth animations, but feel free to make it your own!

Basic Usage

The implementation may vary depending on what pose and face detection model you choose to use, but the principle is still the same. This example uses Mediapipe Holistic which concisely combines them together.

{ await holistic.send({image: HTMLVideoElement}); }, width: 640, height: 480 }); camera.start(); ">

import * as Kalidokit from 'kalidokit'
import '@mediapipe/holistic/holistic';
import '@mediapipe/camera_utils/camera_utils';

let holistic = new Holistic({locateFile: (file) => {
    return `https://cdn.jsdelivr.net/npm/@mediapipe/[email protected]/${file}`;
}});

holistic.onResults(results=>{
    // do something with prediction results
    // landmark names may change depending on TFJS/Mediapipe model version
    let facelm = results.faceLandmarks;
    let poselm = results.poseLandmarks;
    let poselm3D = results.ea;
    let rightHandlm = results.rightHandLandmarks;
    let leftHandlm = results.leftHandLandmarks;

    let faceRig = Kalidokit.Face.solve(facelm,{runtime:'mediapipe',video:HTMLVideoElement})
    let poseRig = Kalidokit.Pose.solve(poselm3d,poselm,{runtime:'mediapipe',video:HTMLVideoElement})
    let rightHandRig = Kalidokit.Hand.solve(rightHandlm,"Right")
    let leftHandRig = Kalidokit.Hand.solve(leftHandlm,"Left")

    };
});

// use Mediapipe's webcam utils to send video to holistic every frame
const camera = new Camera(HTMLVideoElement, {
  onFrame: async () => {
    await holistic.send({image: HTMLVideoElement});
  },
  width: 640,
  height: 480
});
camera.start();

Slight differences with Mediapipe and Tensorflow.js

Due to slight differences in the results from Mediapipe and Tensorflow.js, it is recommended to specify which runtime version you are using as well as the video input/image size as a reference.

Kalidokit.Pose.solve(poselm3D,poselm,{
    runtime:'tfjs', // default is 'mediapipe'
    video: HTMLVideoElement,// specify an html video or manually set image size
    imageSize:{
        width: 640,
        height: 480,
    };
})

Kalidokit.Face.solve(facelm,{
    runtime:'mediapipe', // default is 'tfjs'
    video: HTMLVideoElement,// specify an html video or manually set image size
    imageSize:{
        width: 640,
        height: 480,
    };
})

Outputs

Below are the expected results from KalidoKit solvers.

// Kalidokit.Face.solve()
// Head rotations in radians
// Degrees and normalized rotations also available
{
    eye: {l: 1,r: 1},
    mouth: {
        x: 0,
        y: 0,
        shape: {A:0, E:0, I:0, O:0, U:0}
    },
    head: {
        x: 0,
        y: 0,
        z: 0,
        width: 0.3,
        height: 0.6,
        position: {x: 0.5, y: 0.5, z: 0}
    },
    brow: 0,
    pupil: {x: 0, y: 0}
}

// Kalidokit.Pose.solve()
// Joint rotations in radians, leg calculators are a WIP
{
    RightUpperArm: {x: 0, y: 0, z: -1.25},
    LeftUpperArm: {x: 0, y: 0, z: 1.25},
    RightLowerArm: {x: 0, y: 0, z: 0},
    LeftLowerArm: {x: 0, y: 0, z: 0},
    LeftUpperLeg: {x: 0, y: 0, z: 0},
    RightUpperLeg: {x: 0, y: 0, z: 0},
    RightLowerLeg: {x: 0, y: 0, z: 0},
    LeftLowerLeg: {x: 0, y: 0, z: 0},
    LeftHand: {x: 0, y: 0, z: 0},
    RightHand: {x: 0, y: 0, z: 0},
    Spine: {x: 0, y: 0, z: 0},
    Hips: {
        worldPosition: {x: 0, y: 0, z: 0},
        position: {x: 0, y: 0, z: 0},
        rotation: {x: 0, y: 0, z: 0},
    }
}

// Kalidokit.Hand.solve()
// Joint rotations in radians
// only wrist and thumb have 3 degrees of freedom
// all other finger joints move in the Z axis only
{
    RightWrist: {x: -0.13, y: -0.07, z: -1.04},
    RightRingProximal: {x: 0, y: 0, z: -0.13},
    RightRingIntermediate: {x: 0, y: 0, z: -0.4},
    RightRingDistal: {x: 0, y: 0, z: -0.04},
    RightIndexProximal: {x: 0, y: 0, z: -0.24},
    RightIndexIntermediate: {x: 0, y: 0, z: -0.25},
    RightIndexDistal: {x: 0, y: 0, z: -0.06},
    RightMiddleProximal: {x: 0, y: 0, z: -0.09},
    RightMiddleIntermediate: {x: 0, y: 0, z: -0.44},
    RightMiddleDistal: {x: 0, y: 0, z: -0.06},
    RightThumbProximal: {x: -0.23, y: -0.33, z: -0.12},
    RightThumbIntermediate: {x: -0.2, y: -0.19, z: -0.01},
    RightThumbDistal: {x: -0.2, y: 0.002, z: 0.15},
    RightLittleProximal: {x: 0, y: 0, z: -0.09},
    RightLittleIntermediate: {x: 0, y: 0, z: -0.22},
    RightLittleDistal: {x: 0, y: 0, z: -0.1}
}

Community Showcase

If you'd like to share a creative use of KalidoKit, we would love to hear about it! Feel free to also use our Twitter hashtag, #kalidokit.

Open to Contributions

The current library is a work in progress and contributions to improve it are very welcome. Our goal is to make character face and pose animation even more accessible to creatives regardless of skill level!

Comments

parse js files from project structure

Excuse me I have a question in the docs capertar in the script.js file can't export any functions or new functions within that script.js file, when I want to use in another js file, the question is why is it? can't you export those functions from that script.js file?

opened by ubilrodriguez 16
Improve leg and arm rotation
Improvement of Leg and Arm Rotation

Hi Rich, It took some time and I am not 100% satisfied with this solution, but I think it is an improvement. What has changed:

Types for left and right

Unit test for upper and lower legs

Alternative calculation for the leg rotation

Alternative calculation for the arm rotation

Introduction of a Euler Class to keep track of the rotation order (especially import for the arm rotations)

A little code cleaning

The use of constants for PI and TWO_PI

I'm looking forward to your feedback!
opened by utileetdulce 11
Calling the Solver from node.js

Hi, thanks for your awesome project!

I am wondering whether it's possible to calling the kalidokit face solver from a node.js service.

The story: a windows executable(exe) send an image to a nodejs web service, which runs mediapipe and kalidokit

Can you give me some suggestions?

Thanks!

opened by FishWoWater 10
Fix brow LM indices left/right

I don't know if this is 100% correct, but it seems the brow landmark point indices are mirrored, or at least do not match the pupil side, which messes up the pupils solve because the brows are the bounding box of the pupils.

I don't know if it matters but I'm using holistic latest model.

In my testing I found the pupils math is completely messed up because of this bug -- it is "corrected" by some additional averaging of both eyes, which "fixes" the error but adds a lot of jitter to the pupil tracking. https://github.com/yeemachine/kalidokit/blob/73f202663c32673f7e21c8869bf02f81223bc340/src/FaceSolver/calcEyes.js#L97

opened by avaer 6

Fix: 'Cannot find type definition file for 'node'.' when using yarn to install dependencies

Add yarn.lock file when using yarn to install dependencies and fix the error, showing "Cannot find type definition file for 'node'." message when yarn tsc -p . building. Error messages shown below:

> [email protected] build:lib
> tsc -p .

../../node_modules/@types/plist/index.d.ts:7:23 - error TS2688: Cannot find type definition file for 'node'.

7 /// <reference types="node" />
                        ~~~~

../../node_modules/@types/plist/index.d.ts:21:10 - error TS2580: Cannot find name 'Buffer'. Do you need to install type definitions for node? Try `npm i --save-dev @types/node`.

21   Date | Buffer | PlistObject | PlistArray;
            ~~~~~~

../../node_modules/@types/responselike/index.d.ts:6:23 - error TS2688: Cannot find type definition file for 'node'.

6 /// <reference types="node" />
                        ~~~~

../../node_modules/@types/responselike/index.d.ts:8:33 - error TS2307: Cannot find module 'http' or its corresponding type declarations.

8 import { IncomingMessage } from 'http';
                                  ~~~~~~

../../node_modules/@types/responselike/index.d.ts:9:24 - error TS2307: Cannot find module 'stream' or its corresponding type declarations.

9 import { Stream } from 'stream';
                         ~~~~~~~~

../../node_modules/@types/responselike/index.d.ts:19:11 - error TS2580: Cannot find name 'Buffer'. Do you need to install type definitions for node? Try `npm i --save-dev @types/node`.

19     body: Buffer;
             ~~~~~~

../../node_modules/@types/responselike/index.d.ts:31:15 - error TS2580: Cannot find name 'Buffer'. Do you need to install type definitions for node? Try `npm i --save-dev @types/node`.

31         body: Buffer,
                 ~~~~~~

../../node_modules/xmlbuilder/typings/index.d.ts:2:26 - error TS2307: Cannot find module 'stream' or its corresponding type declarations.

2 import { Writable } from 'stream';
                           ~~~~~~~~


Found 8 errors.

error Command failed with exit code 2.```

opened by LehaoLin 5

Failed to acquire camera feed

when i open the site (selfhosted) i get the error: Failed to acquire camera feed: AbortError: Starting videoinput failed I tested it with Firefox. In chrome i get the error: Failed to acquire camera feed: NotReadableError: Could not start video source

opened by Minionflo 4
[Feature Request] Release kalidokit with types

I install kalidokit by npm. And can not find types for it.

But i clone this repo and build it myself. I find inde.d.ts, etc. (by tsc)

Did you forget to re-release the new version?

opened by YunYouJun 4
extraneous dependencies
currently kalidokit package has following hard dependencies:

"dependencies": { "@mediapipe/camera_utils": "^0.3.1632432234", "@mediapipe/drawing_utils": "^0.3.1620248257", "@mediapipe/holistic": "^0.5.1635989137", "@pixiv/three-vrm": "^0.6.7", "three": "^0.133.0" },

which means they will all be installed when installing kalidokit which makes it at least 100x bigger than it should be

and in reality, all of those dependencies are needed only for demos and not needed at all for actual solver. please remove hard dependencies - you can make them optional instead

for example, i prefer to use my own implementation of ml processing (using different mediapipe models and using custom processing instead of mediapipe official modules) and using babylonjs instead of threejs
opened by vladmandic 3
Using other Pose Trackers instead of MediaPipe

Is there any particular reason to use MediaPipe? Why not use another method like VIBE, which directly gives output in the format of 3D rotation data for joints (instead of 3D Cartesian coordinates like MediaPipe). This way, there wouldn't be the need to perform the complex kinematic calculations which is the majority of the code of this kalidokit library.

opened by hshreeshail 2
[NEED HELP]: Why use screen space coordinates to calculate Euler angles?
Hi ~ Thanks for your great work ! i have encounter some problems when reading your source code. The main confusion is that the output landmarks from mediapipe FaceMesh is in Screen Coordinates as mention at face_mesh#face-transform-module, but you have use these landmarks to calculate Euler angles in your rollPitchYaw method. The pipeline for FaceSolver is look like:

scree coords 3D landmarks -> multiply imagesize(still screen coords) -> calcHead() -> rollPitchYaw() use scree coords

The landmarks detected by mediapipe is not in World Space nor Model Space, I just can not understand the calculation logic about the pipeline above. Can you tell me why this is valid? It will make a great help to me, many thanks ~
opened by DefTruth 2

npm run test fails with vite error

When running npm run test or npm run dev to start up the demo projects in the docs folder, vite fails to start the dev server, claiming a missing dependency in the Live2D folder.

Despite the error claiming that script.js could not be resolved, I've verified that the script.js file is present in docs/live2d, and I haven't touched it or index.html in any way. The only change I've made so far is removing the pre-build script from package.json to workaround #19

Platform: Windows 10

Steps to reproduce:

Clone Kalidokit repository
Remove prebuild script from package.json
Run "npm install"
Run "npm run test"

Full error output:

C:\Projects\Programming\kalidokit>npm run test

> [email protected] test C:\Projects\Programming\kalidokit
> npm run build && npm run dev


> [email protected] build C:\Projects\Programming\kalidokit
> vite build && npm run build:lib

vite v2.6.0 building for production...
Unknown input options: exports. Allowed options: acorn, acornInjectPlugins, cache, context, experimentalCacheExpiry, external, inlineDynamicImports, input, makeAbsoluteExternalsRelative, manualChunks, maxParallelFileReads, moduleContext, 
onwarn, perf, plugins, preserveEntrySignatures, preserveModules, preserveSymlinks, shimMissingExports, strictDeprecations, treeshake, watch
✓ 13 modules transformed.
dist/kalidokit.es.js   31.87 KiB / gzip: 6.99 KiB
dist/kalidokit.umd.js   16.10 KiB / gzip: 5.38 KiB

> [email protected] build:lib C:\Projects\Programming\kalidokit
> tsc -p .


> [email protected] dev C:\Projects\Programming\kalidokit
> vite docs


  vite v2.6.0 dev server running at:

error when starting dev server:
Error: The following dependencies are imported but could not be resolved:

  script.js (imported by C:/Projects/Programming/kalidokit/docs/live2d/index.html)

Are they installed?
    at optimizeDeps (C:\Projects\Programming\kalidokit\node_modules\vite\dist\node\chunks\dep-a8e01c9f.js:71146:15)     
    at processTicksAndRejections (internal/process/task_queues.js:93:5)
    at async runOptimize (C:\Projects\Programming\kalidokit\node_modules\vite\dist\node\chunks\dep-a8e01c9f.js:74997:48)
    at async Server.httpServer.listen (C:\Projects\Programming\kalidokit\node_modules\vite\dist\node\chunks\dep-a8e01c9f.js:75013:21)
npm ERR! code ELIFECYCLE
npm ERR! errno 1
npm ERR! [email protected] dev: `vite docs`
npm ERR! Exit status 1
npm ERR!
npm ERR! Failed at the [email protected] dev script.
npm ERR! This is probably not a problem with npm. There is likely additional logging output above.

opened by Renpona 2

Hollisitic Pose with ReadyPlayer Me Avatar

Hello, Thanks for your component that fill the gap between MediaPipe and Avatar Puppetering.

I made a small JSFiddle using ReadyPlayer Me Avatar with MediaPipe and Kalidokit https://jsfiddle.net/c0gxfk9y/1/

I only setup the Kalidokit.Pose.solve() I don't know if I correctly apply the pose Rig ? The animations seems OK but the arms are upside down.

I didn't apply the position/rotation on hips, because the avatar is moved anywhere. Next I'll add hand pose.

Can you point me in the right direction ? I feel close to something that would correctly do the job.

opened by JpEncausse 11
[BUG] Bug in Eyebrow calculation?

https://github.com/yeemachine/kalidokit/blob/c720d483df867996be96e2c3218e963da9190ecf/src/FaceSolver/calcEyes.ts#L244-L277

I have written a simple python version of your calEyes.ts and combine it with mediapipe's output directly. However, here in L260, the ratio is calculated as:browRatio = browDistance / maxBrowRatio - 1;. And browDistance is always smaller than maxBrowRatio, which means thatbrowRatiowill always be a negative number. This will cause that browRaiseRatio will always be 0 after clamp.

Is this a bug?

Update: I found out the this was due to different video input. When my face is close to the camera, the above problem will happen. However when I get to a middle distance (half body in the image). This will not happen. I don't know what caused this.

So the problem now is: browDistance will change over different video input. Do you have any clues on how to solve thiS? It is very weird that browDistance = eyeLidAvg / eyeWidth will have huge changes because the ratio should be the same for the same person. Should I use a cropped image for facial landmark extraction?

opened by ZhengdiYu 0
$Running kalidokit\docs\script.js$

Running kalidokit\docs\script.js

Hi.

I am trying to run kalidokit\docs\script.js in VSCode. Running into the following error after "npm install kalidokit" Any advice on this?

[Running] node "c:\Users\USER\kalidokit\docs\script.js" (node:11004) Warning: To load an ES module, set "type": "module" in the package.json or use the .mjs extension. (Usenode --trace-warnings ...` to show where the warning was created) c:\Users\USER\kalidokit\docs\script.js:1 import * as Kalidokit from "../dist"; ^^^^^^

SyntaxError: Cannot use import statement outside a module at Object.compileFunction (node:vm:360:18) at wrapSafe (node:internal/modules/cjs/loader:1084:15) at Module._compile (node:internal/modules/cjs/loader:1119:27) at Object.Module._extensions..js (node:internal/modules/cjs/loader:1209:10) at Module.load (node:internal/modules/cjs/loader:1033:32) at Function.Module._load (node:internal/modules/cjs/loader:868:12) at Function.executeUserEntryPoint [as runMain] (node:internal/modules/run_main:81:12) at node:internal/main/run_main_module:22:47`

opened by ss8319 6
how to load bvh or fbx file

is there any way to load bvh or fbx file ? and , is there some middlewear to save the face & body animation and drive the vrm asynchronously

opened by yslion 0

Kalidokit is a blendshape and kinematics solver for Mediapipe/Tensorflow.js face, eyes, pose, and hand tracking models

Related tags

Overview

KalidoKit - Face, Pose, and Hand Tracking Kinematics

Install

Via NPM

Via CDN

Methods

Remixable VRM Template with KalidoKit

Basic Usage

Slight differences with Mediapipe and Tensorflow.js

Outputs

Community Showcase

Open to Contributions

Comments

Improvement of Leg and Arm Rotation

Owner

Rich

Controlling a game using mediapipe hand tracking

A robotic arm that mimics hand movement through MediaPipe tracking.

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

This is a Machine Learning Based Hand Detector Project, It Uses Machine Learning Models and Modules Like Mediapipe, Developed By Google!

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

Some simple programs built in Python: webcam with cv2 that detects eyes and face, with grayscale filter

SE3 Pose Interp - Interpolate camera pose or trajectory in SE3, pose interpolation, trajectory interpolation

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

This project contains an implemented version of Face Detection using OpenCV and Mediapipe. This is a code snippet and can be used in projects.

A python library for face detection and features extraction based on mediapipe library

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

Hand-distance-measurement-game - Hand Distance Measurement Game

Face and Pose detector that emits MQTT events when a face or human body is detected and not detected.

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Sudoku solver - A sudoku solver with python

Simple Linear 2nd ODE Solver GUI - A 2nd constant coefficient linear ODE solver with simple GUI using euler's method

An Inverse Kinematics library aiming performance and modularity

This folder contains the python code of UR5E's advanced forward kinematics model.