Skip to content

Commit 09b111a

Browse files
committed
[MNN:Sync] Sync Internal 3.1.4.
1 parent 16bc909 commit 09b111a

File tree

98 files changed

+18022
-10939
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

98 files changed

+18022
-10939
lines changed

LICENSE.txt

Lines changed: 0 additions & 201 deletions
This file was deleted.

README.md

Lines changed: 6 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,12 @@
11
![MNN](doc/banner.png)
2-
---
3-
[![License](https://img.shields.io/github/license/alibaba/MNN)](LICENSE.txt)
4-
[![Documentation](https://img.shields.io/badge/Documentation-Read-green)](https://mnn-docs.readthedocs.io/en/latest/)
5-
[![中文版本](https://img.shields.io/badge/Language-%E7%AE%80%E4%BD%93%E4%B8%AD%E6%96%87-green)](README_CN.md)
6-
[![日本語バージョン](https://img.shields.io/badge/Language-%E6%97%A5%E6%9C%AC%E8%AA%9E-green)](README_JP.md)
7-
[![MNN Homepage](https://img.shields.io/badge/Homepage-Visit-green)](http://www.mnn.zone)
2+
3+
[中文版本](README_CN.md)
4+
5+
[日本語バージョン](README_JP.md)
6+
7+
[MNN Homepage](http://www.mnn.zone)
88

99
## News 🔥
10-
- [2025/04/30] android app support qwen3 and dark mode [MnnLlmChat](./apps/Android/MnnLlmChat/README.md#version-040).
11-
<p align="center">
12-
<img width="20%" alt="Icon" src="https://meta.alicdn.com/data/mnn/qwen_3.gif" style="margin: 0 10px;">
13-
</p>
14-
<details>
15-
<summary> History News </summary>
1610
- [2025/02/18] iOS multimodal LLM App is released [MNN LLM iOS](./apps/iOS/MNNLLMChat/README.md).
1711
<p align="center">
1812
<img width="20%" alt="Icon" src="./apps/iOS/MNNLLMChat/assets/introduction.gif" style="margin: 0 10px;">
@@ -30,7 +24,6 @@
3024
<img width="20%" alt="Icon" src="./apps/Android/MnnLlmChat/assets/image_sound.jpg" style="margin: 0 10px;">
3125
<img width="20%" alt="Icon" src="./apps/Android/MnnLlmChat/assets/image_image.jpg" style="margin: 0 10px;">
3226
</p>
33-
</details>
3427

3528
## Intro
3629
MNN is a highly efficient and lightweight deep learning framework. It supports inference and training of deep learning models and has industry-leading performance for inference and training on-device. At present, MNN has been integrated into more than 30 apps of Alibaba Inc, such as Taobao, Tmall, Youku, DingTalk, Xianyu, etc., covering more than 70 usage scenarios such as live broadcast, short video capture, search recommendation, product searching by image, interactive marketing, equity distribution, security risk control. In addition, MNN is also used on embedded devices, such as IoT.

README_CN.md

Lines changed: 9 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,10 @@
11
![MNN](doc/banner.png)
2-
---
3-
[![License](https://img.shields.io/github/license/alibaba/MNN)](LICENSE.txt)
4-
[![Documentation](https://img.shields.io/badge/Documentation-Read-green)](https://mnn-docs.readthedocs.io/en/latest/)
5-
[![English Version](https://img.shields.io/badge/Language-English-green)](README.md)
6-
[![日本語バージョン](https://img.shields.io/badge/Language-%E6%97%A5%E6%9C%AC%E8%AA%9E-green)](README_JP.md)
7-
[![MNN Homepage](https://img.shields.io/badge/Homepage-Visit-green)](http://www.mnn.zone)
82

3+
[English Version](README.md)
4+
5+
[日本語バージョン](README_JP.md)
6+
7+
[MNN Homepage](http://www.mnn.zone)
98

109
[MNN](https://github.com/alibaba/MNN)是一个轻量级的深度神经网络引擎,支持深度学习的推理与训练。适用于服务器/个人电脑/手机/嵌入式各类设备。目前,MNN已经在阿里巴巴的手机淘宝、手机天猫、优酷等30多个App中使用,覆盖直播、短视频、搜索推荐、商品图像搜索、互动营销、权益发放、安全风控等场景。
1110

@@ -41,15 +40,15 @@ MNN文档:
4140

4241
## 整体特点
4342

44-
### 轻量性
43+
### 轻量性
4544

46-
- 主体功能(模型推理CPU+GPU)无任何依赖,代码精简,可以方便地部署到移动设备和各种嵌入式设备中。
45+
- 主体功能(模型推理CPU+GPU)无任何依赖,代码精简,可以方便地部署到移动设备和各种嵌入式设备中。
4746
- iOS平台:功能全开的MNN静态库 armv7+arm64大小12MB左右,链接生成可执行文件增加大小2M左右。可裁剪主体功能后静态库大小6.1M ,链接生成可执行文件增加大小 600 KB。
4847
- Android平台:主体功能 armv7a - c++_shared 动态库大小800KB左右。
4948
- 支持采用 Mini 编辑选项进一步降低包大小,大约能在上述库体积基础上进一步降低 25% 左右。
5049
- 支持模型FP16/Int8压缩与量化,可减少模型50% - 75% 的体积
5150

52-
### 通用性
51+
### 通用性
5352

5453
- 支持 Tensorflow、Caffe、ONNX、Torchscripts 等主流模型文件格式,支持CNN / RNN / GAN / Transformer 等主流网络结构。
5554
- 支持多输入多输出,支持任意维度的输入输出,支持动态输入(输入大小可变),支持带控制流的模型
@@ -110,7 +109,7 @@ MNN适配的硬件架构与精度详见下表:
110109
## 社区交流与反馈
111110
钉钉群组:
112111

113-
- 钉钉群1:23329087
112+
- 钉钉群1:23329087
114113
- 钉钉群2:23350225
115114
- 钉钉群3:扫描二维码加入
116115

README_JP.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,18 @@
11
![MNN](doc/banner.png)
2-
---
3-
[![License](https://img.shields.io/github/license/alibaba/MNN)](LICENSE.txt)
4-
[![Documentation](https://img.shields.io/badge/Documentation-Read-green)](https://mnn-docs.readthedocs.io/en/latest/)
5-
[![English Version](https://img.shields.io/badge/Language-English-green)](README.md)
6-
[![中文版本](https://img.shields.io/badge/Language-%E7%AE%80%E4%BD%93%E4%B8%AD%E6%96%87-green)](README_CN.md)
7-
[![MNN ホームページ](https://img.shields.io/badge/Homepage-Visit-green)](http://www.mnn.zone)
2+
3+
[English Version](README.md)
4+
5+
[中文版本](README_CN.md)
6+
7+
[MNN ホームページ](http://www.mnn.zone)
88

99
## ニュース 🔥
1010
- [2025/01/23] フルマルチモーダル LLM Android アプリをリリースしました: [MNN-LLM-Android](./project/android/apps/MnnLlmApp/README.md)。テキストからテキスト、画像からテキスト、音声からテキスト、テキストから画像生成を含みます。
1111
<p align="center">
12-
<img width="20%" alt="Icon" src="./apps/Android/MnnLlmChat/assets/image_home.jpg" style="margin: 0 10px;">
13-
<img width="20%" alt="Icon" src="./apps/Android/MnnLlmChat/assets/image_diffusion.jpg" style="margin: 0 10px;">
14-
<img width="20%" alt="Icon" src="./apps/Android/MnnLlmChat/assets/image_sound.jpg" style="margin: 0 10px;">
15-
<img width="20%" alt="Icon" src="./apps/Android/MnnLlmChat/assets/image_image.jpg" style="margin: 0 10px;">
12+
<img width="20%" alt="Icon" src="./project/android/apps/MnnLlmApp/assets/image_home.jpg" style="margin: 0 10px;">
13+
<img width="20%" alt="Icon" src="./project/android/apps/MnnLlmApp/assets/image_diffusion.jpg" style="margin: 0 10px;">
14+
<img width="20%" alt="Icon" src="./project/android/apps/MnnLlmApp/assets/image_sound.jpg" style="margin: 0 10px;">
15+
<img width="20%" alt="Icon" src="./project/android/apps/MnnLlmApp/assets/image_image.jpg" style="margin: 0 10px;">
1616
</p>
1717

1818
## 紹介

backupcode/cpubackend/compute/GemmInt8Executor.cpp

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -36,12 +36,11 @@ static void _makeResource(Backend* backend, std::shared_ptr<CPUConvolution::Reso
3636
auto alphaPtr = resource->mDequantize.mScaleBias->host<float>();
3737
auto biasPtr = reinterpret_cast<float*>(reinterpret_cast<uint8_t*>(alphaPtr) + ocUp4 * core->bytes);
3838
::memset(alphaPtr, 0, 2 * ocUp4 * core->bytes);
39-
auto wZero = resourceInt8->mWeightQuantZero->host<int32_t>(); // has packed to outputUp4
4039
auto wScale = resourceInt8->mOriginScale->host<float>();
4140
int h = ocUp4;
4241
for (int i=0; i< h; ++i) {
4342
alphaPtr[i] = wScale[i];
44-
biasPtr[i] = (-1.f) * wZero[i] * wScale[i];
43+
biasPtr[i] = wScale[i + ocUp4];
4544
}
4645
}
4746

@@ -185,8 +184,8 @@ ErrorCode GemmInt8Executor::onExecute(const std::vector<Tensor *> &inputs, const
185184

186185
quanParam.useInt8 = 0; // Save result as float data type.
187186
quanParam.biasFloat = reinterpret_cast<float*>(mQuantBias.data());
188-
quanParam.weightQuanBias = mKernelSum.data();
189-
quanParam.extraScale = nullptr;
187+
quanParam.weightKernelSum = mKernelSum.data();
188+
quanParam.inputScale = nullptr;
190189
float dequantScale = mMutableResource.mResource->mInputScale;
191190

192191
SumByAxisParams sumParams;

0 commit comments

Comments
 (0)