Tencent/ncnn 20210720 on GitHub

编译版本，默认配置，android-ndk-r21d，xcode 12.2，ubuntu-16.04，ubuntu-18.04，ubuntu-20.04，vs2015, vs2017，vs2019, emscripten-2.0.8

file	content	arch
ncnn-full-source.zip	包含全部 submodule 代码的完整源码
ncnn-android.zip	android 静态库/动态库	armeabi-v7a + arm64-v8a + x86 + x86_64
ncnn-android-vulkan.zip	android 静态库/动态库，支持 GPU	armeabi-v7a + arm64-v8a + x86 + x86_64
ncnn-ios.zip	ios 静态库，with and w/o bitcode	armv7 + arm64 + arm64e + i386 + x86_64
ncnn-ios-vulkan.zip	ios 静态库，支持 GPU，with and w/o bitcode	arm64 + arm64e + x86_64
ncnn-macos.zip	macos 静态库	x86_64 + arm64
ncnn-macos-vulkan.zip	macos 静态库，支持 GPU	x86_64 + arm64
ncnn-ubuntu.zip	ubuntu linux 静态库/动态库，支持 GPU，模型转换工具	x86_64
ncnn-windows.zip	windows 静态库/动态库，支持 GPU，模型转换工具	x86 + x86_64
ncnn-webassembly.zip	webassembly 静态库	wasm32 + simd + threads + simd-threads

risc-v架构cpu的vector 0.7.1和1.0扩展的算子优化，包括fp32/fp16，支持可变vlen，运行时检测vector和半精度扩展与函数分发(absval, cast, clip, concat, convolution, convolutiondepthwise, crop, deconvolution, deconvolutiondepthwise, flatten, innerproduct, interp, mish, packing, padding, pooling, relu, sigmoid, swish, tanh, unaryop)
mips架构cpu的msa扩展的算子优化，运行时检测msa扩展与函数分发(absval, bias, binaryop, clip, concat, convolution, convolutiondepthwise, crop, deconvolution, deconvolutiondepthwise, dropout, eltwise, flatten, hardsigmoid, hardswish, innerproduct, interp, mish, packing, padding, pooling, prelu, relu, sigmoid, slice, softmax, swish, tanh, unaryop)
运行时检测avx，优化avx-only平台的性能(Timen)
支持loongarch64架构编译(tsuibin)
在apple设备上总是启用armv8.2 dot
可以不依赖opencv用simpleocv编译example示例
改善visual studio配合clang编译(Timen)
新增cmake选项NCNN_BF16，可禁止所有bf16相关代码减小库体积
大幅更新operators算子文档
arm neon数学函数fma优化
arm neon tanh优化(deepage)
AbsVal/ReLU的risc-v vector优化(thelastlin)
修正x86 requantize pack4to1计算错误
修正innerproduct gemm in8计算错误(lsdustc)
修正warpaffine_bilinear_yuv420sp uv变换矩阵错误(DaydreamCoding)
修正yuv420sp2rgb在armv7平台越界读数据问题(zchrissirhcz)
修正vulkan push_constant编码时的内存泄漏(chenxiemin)
darknet2ncnn支持sam层和letter_box检测(zhiliu6)
修复darknet2ncnn pad=0转换错误(zhiliu6)
修正ncnn2table工具解析命令行长数字错误
优化ncnn2table多线程效率
ncnn2table支持动态输入(jinmingyi1998)
修正ncnn2table easyquant方法文件索引错误(lsdustc)
修正ncnnoptimize丢失interp参数问题(jinmingyi1998)
修复ncnnoptimize在替换conv为fc算子后可能的段错误
修复ncnnoptimize在遇到孤立节点时可能的段错误
修复macos ci swiftshader编译
修正test_extractor.py的TypeError断言(zhiqwang)
修复macos的编译警告(proydakov)
risc-v ci升级qemu 6.0.0并支持rvv扩展
ci现在过滤更改文件，减少触发
新增c906工具链
更新visual studio编译中文文档(zchrissirhcz)
修正examples yolov4视频加载错误(uniartisan)
readme添加pocky群号(JuYanYan)
readme添加scrfd(ncnnnnn)
增加issue模板(tpoisonooo)
一些拼写错误修正(hwdef)
benchmark新增nanodet_m模型(BUG1989)
benchmark新增v1605b数据(kalcohol)
benchmark新增loongson 2k1000数据
benchmark更新jetson agx数据(zineos)
新增代码格式化ci，禁用restyled

New Contributors

@JuYanYan made their first contribution in #2956
@uniartisan made their first contribution in #3005
@hwdef made their first contribution in #3045
@DaydreamCoding made their first contribution in #3048
@sdli1995 made their first contribution in #3081
@chenxiemin made their first contribution in #3088
@tsuibin made their first contribution in #3094

Full Changelog: 2021052...2021072

Tencent/ncnn 20210720 android ios macos linux windows webassembly 预编译库 20210720 2c4ae09 on GitHub

New Contributors

Tencent/ncnn 20210720
android ios macos linux windows webassembly 预编译库 20210720 2c4ae09

on GitHub