KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

Last update: Dec 27, 2022

Related tags

Deep Learning KSAI-Lite

Overview

KSAI Lite

English | 简体中文

KSAI Lite是一个轻量级、灵活性强、高性能且易于扩展的深度学习推理框架，底层基于tensorflow lite，定位支持包括移动端、嵌入式以及服务器端在内的多硬件平台。

当前KSAI Lite已经应用在金山office内部业务中，并逐步支持金山企业的生产任务和众多外部用户。

快速入门

使用KSAI Lite，只需几个简单的步骤，就可以把模型部署到多种终端设备中，运行高性能的推理任务，使用流程如下所示：

一. 准备模型

KSAI Lite框架直接支持模型结构为tflite模型。如果您手中的模型是由诸如Caffe、MXNet、PyTorch等框架产出的，那么您可以使用工具将模型转换为tflite格式。

二. 模型优化

KSAI Lite框架基于底层tensorflow lite的优化方法，拥有优秀的加速、优化策略及实现，包含量化、子图融合、Kernel优选等优化手段。优化后的模型更轻量级，耗费资源更少，并且执行速度也更快。

三. 下载或编译

KSAI Lite提供了多平台的官方Release预测库下载，我们优先推荐您直接下载 KSAI Lite预编译库,包括了Linux-X64, Linux-ARM, Linux-MIPS64以及Windows-X64索引库和Windows-X64动态链接库。您也可以根据目标平台选择对应的源码编译方法。KSAI Lite 提供了源码编译脚本，位于 tools/目录下，只需要按照docs/目录下的准备环境说明文档environment setup.md搭建好环境然后切到tools/目录调用编译脚本两个步骤即可一键编译得到目标平台的KSAI Lite预测库。

四. 预测示例

KSAI Lite提供了C++ API，并且提供了相应API的完整使用示例: 目录为tensorflow/lite/examples/reg_test/reg_test.cc 您可以参考示例快速了解使用方法，并集成到您自己的项目中去，也可以参考KSAI-Toolkits该项目。

主要特性

多硬件支持
- KSAI Lite架构已经验证和完整支持从 Mobile 到 Server 多种硬件平台，包括 intel X86、ARM、华为 Kunpeng 920、龙芯Loongson-3A R3、兆芯C4600、Phytium FT1500a等，且正在不断增加更多新硬件支持。
轻量级部署：
- KSAI Lite在设计上对图优化模块和执行引擎实现了良好的解耦拆分，移动端可以直接部署执行阶段，无任何第三方依赖。
高性能
- 极致的 ARM及X86 CPU 性能优化：针对不同微架构特点实现kernel的定制，最大发挥计算性能，在主流模型上展现出领先的速度优势。
多模型多算子：
- KSAI Lite和tensorflow训练框架的OP对齐，提供广泛的模型支持能力。
- 目前已对视觉类模型做到了较为充分的支持，覆盖分类、检测和识别，包含了特色的OCR模型的支持，并在不断丰富中。
强大的图分析和优化能力：
- 不同于常规的移动端预测引擎基于 Python 脚本工具转化模型， Lite 架构上有完整基于 C++ 开发的 IR 及相应 Pass 集合，以支持操作融合，计算剪枝，存储优化，量化计算等多类计算图优化。

持续集成

System	X86 Linux	ARM Linux	MIPS64 Linux	windows
CPU(32bit)		-	-
CPU(64bit)		-	-
高通骁龙845	-		-	-
华为kunpeng920	-		-	-
龙芯Loongson-3A	-	-		-
兆芯C4600	-		-	-
Phytium FT1500a	-		-	-

交流与反馈

欢迎您通过Github Issues来提交问题、报告与建议

版权和许可证

KSAI-Lite由Apache-2.0 license提供

You might also like...

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready inference.

Yolov5 running on TorchServe (GPU compatible) ! This is a dockerfile to run TorchServe for Yolo v5 object detection model. (TorchServe (PyTorch librar

82 Nov 29, 2022

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

human-pose-estimation-3d-python-cpp RealSenseD435 (RGB) 480x640 + CPU Corei9 45 FPS (Depth is not used) 1. Run 1-1. RealSenseD435 (RGB) 480x640 + CPU

8 Oct 3, 2022

Data-depth-inference - Data depth inference with python

Welcome! This readme will guide you through the use of the code in this reposito

3 Feb 8, 2022

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

English | 简体中文 Why Non-Euclidean Geometry Considering these simple graph structures shown below. Nodes with same color has 2-hop distance whereas 1-ho

123 Dec 12, 2022

⚡ Fast • 🪶 Lightweight • 0️⃣ Dependency • 🔌 Pluggable • 😈 TLS interception • 🔒 DNS-over-HTTPS • 🔥 Poor Man's VPN • ⏪ Reverse & ⏩ Forward • 👮🏿 "Proxy Server" framework • 🌐 "Web Server" framework • ➵ ➶ ➷ ➠ "PubSub" framework • 👷 "Work" acceptor & executor framework

Table of Contents Features Install Using PIP Stable version Development version Using Docker Stable version Development version Using HomeBrew Stable

2.2k Jan 8, 2023

A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation

Aboleth A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation [1] with stochastic gradient variational Bayes

127 Dec 12, 2022

TensorFlow implementation of "Variational Inference with Normalizing Flows"

[TensorFlow 2] Variational Inference with Normalizing Flows TensorFlow implementation of "Variational Inference with Normalizing Flows" [1] Concept Co

7 Jun 8, 2022

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases. Ivy wraps the functional APIs of existing frameworks. Framework-agnostic functions, libraries and layers can then be written using Ivy, with simultaneous support for all frameworks. Ivy currently supports Jax, TensorFlow, PyTorch, MXNet and Numpy. Check out the docs for more info!

8.2k Jan 2, 2023

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

PyVarInf PyVarInf provides facilities to easily train your PyTorch neural network models using variational inference. Bayesian Deep Learning with Vari

342 Dec 2, 2022

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

Related tags

Overview

KSAI Lite

快速入门

主要特性

持续集成

交流与反馈

版权和许可证

You might also like...

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready inference.

Monocular 3D pose estimation. OpenVINO. CPU inference or iGPU (OpenCL) inference.

Data-depth-inference - Data depth inference with python

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

Owner

A lightweight face-recognition toolbox and pipeline based on tensorflow-lite

Python scripts to detect faces in Python with the BlazeFace Tensorflow Lite models

A tutorial showing how to train, convert, and run TensorFlow Lite object detection models on Android devices, the Raspberry Pi, and more!

Example scripts for the detection of lanes using the ultra fast lane detection model in Tensorflow Lite.

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Swapping face using Face Mesh with TensorFlow Lite

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in Tensorflow Lite.

Python scripts for performing stereo depth estimation using the MobileStereoNet model in Tensorflow Lite.

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.