优快云太难用了。。。以后将转战知乎。。。
文献阅读:
Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms
本文用LUT做的run-time management, 就是在offline中先用accuracy predictor和latency predictor去预测sub-network延迟和acc,然后存在LUT里,在runtime的时候online时候 runtime manager读LUT,找满足要求的配置,配置是指不同精度和latency的模型,论文里是不同level
all in all是分两步,offline评估和online schedule
- Pre-sample a family of sub-networkfrom a static OFA (one for all network) and contains a runtime manager to choose different sub-networks under different runtime environments. 从OFA中预采样小网络,用runtime manager在不同runtime 环境下选择不同的子网络。
-