 keras tutorialalgorithm (CNN, RNN, etc.,) can be represented in a simple and efficient manner. The following diagram depicts the relationship between model, layer and core modules: Let us see the overview of Keras0 码力 | 98 页 | 1.57 MB | 1 年前3 keras tutorialalgorithm (CNN, RNN, etc.,) can be represented in a simple and efficient manner. The following diagram depicts the relationship between model, layer and core modules: Let us see the overview of Keras0 码力 | 98 页 | 1.57 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 1 - Introductionduring inference, inference latency, etc. Using the sensitive tweet classifier example, during the deployment phase the user will be concerned about the inference efficiency and should be aware of what is expenditure on their data-centers, hence any efficiency gains are very significant. Enabling On-Device Deployment With the advent of smartphones, Internet-of-Things (IoT) devices (refer to Figure 1-5 for the deployed, to learn from a larger more accurate model (teacher) which might not be suitable for deployment. The larger model is used to generate soft labels on the training data, and the student model learns0 码力 | 21 页 | 3.17 MB | 1 年前3 《Efficient Deep Learning Book》[EDL] Chapter 1 - Introductionduring inference, inference latency, etc. Using the sensitive tweet classifier example, during the deployment phase the user will be concerned about the inference efficiency and should be aware of what is expenditure on their data-centers, hence any efficiency gains are very significant. Enabling On-Device Deployment With the advent of smartphones, Internet-of-Things (IoT) devices (refer to Figure 1-5 for the deployed, to learn from a larger more accurate model (teacher) which might not be suitable for deployment. The larger model is used to generate soft labels on the training data, and the student model learns0 码力 | 21 页 | 3.17 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 3 - Learning TechniquesNow, let’s say that the performance threshold for a given model to be considered feasible for deployment is a classification accuracy of 80%. Any additional improvement is not required, and we would prefer acceptable for deployment (it meets 80% accuracy target). Whereas, among the models with the learning techniques, four models with the smallest being the 150 KB model are valid deployment candidates. Learning performance when compared to a single model, but now we have multiple models which also multiplies our deployment costs. Hinton et al.18, in their seminal work explored how smaller student networks can be taught0 码力 | 56 页 | 18.93 MB | 1 年前3 《Efficient Deep Learning Book》[EDL] Chapter 3 - Learning TechniquesNow, let’s say that the performance threshold for a given model to be considered feasible for deployment is a classification accuracy of 80%. Any additional improvement is not required, and we would prefer acceptable for deployment (it meets 80% accuracy target). Whereas, among the models with the learning techniques, four models with the smallest being the 150 KB model are valid deployment candidates. Learning performance when compared to a single model, but now we have multiple models which also multiplies our deployment costs. Hinton et al.18, in their seminal work explored how smaller student networks can be taught0 码力 | 56 页 | 18.93 MB | 1 年前3
 PyTorch Brand Guidelinesmachine learning framework that accelerates the path from research prototyping to production deployment. Learn more at PyTorch0 码力 | 12 页 | 34.16 MB | 1 年前3 PyTorch Brand Guidelinesmachine learning framework that accelerates the path from research prototyping to production deployment. Learn more at PyTorch0 码力 | 12 页 | 34.16 MB | 1 年前3
 华为云深度学习在文本分类中的实践-李明磊embedding Classification Matching Wordpiece Keras tokenizer Jieba Hanlp Model Saving Deployment Testing Vocab Sequence labeling Huawei tokenizer word2vec Elmo pb ckpt H5 (Keras) RESTful0 码力 | 23 页 | 1.80 MB | 1 年前3 华为云深度学习在文本分类中的实践-李明磊embedding Classification Matching Wordpiece Keras tokenizer Jieba Hanlp Model Saving Deployment Testing Vocab Sequence labeling Huawei tokenizer word2vec Elmo pb ckpt H5 (Keras) RESTful0 码力 | 23 页 | 1.80 MB | 1 年前3
 微博在线机器学习和深度学习实践-黄波在线机器学习-模型服务部署 • 模型评估 • 模型上线部署前指标评估 • 周期使用验证样本进行点击率预估 • 待部署模型与线上模型进行指标对比,评估是否满足上线条件 • 一键部署 • 基于K8S的deployment模式,一键端口分配与模型服务部署 • 基于ZK的服务发现,一键进行流量灰度与发布 • 性能优化 • 通信优化:特征请求与模型计算单元化,在线样本格式压缩 • 计算优化:基于SSE/AVX0 码力 | 36 页 | 16.69 MB | 1 年前3 微博在线机器学习和深度学习实践-黄波在线机器学习-模型服务部署 • 模型评估 • 模型上线部署前指标评估 • 周期使用验证样本进行点击率预估 • 待部署模型与线上模型进行指标对比,评估是否满足上线条件 • 一键部署 • 基于K8S的deployment模式,一键端口分配与模型服务部署 • 基于ZK的服务发现,一键进行流量灰度与发布 • 性能优化 • 通信优化:特征请求与模型计算单元化,在线样本格式压缩 • 计算优化:基于SSE/AVX0 码力 | 36 页 | 16.69 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 6 - Advanced Learning Techniques - Technical ReviewFigure 6-7: An example of few-shot learning with a large language model. One of the prominent deployment of such models is the GitHub’s Copilot software9 where GPT-3 is used for auto-completing code snippets0 码力 | 31 页 | 4.03 MB | 1 年前3 《Efficient Deep Learning Book》[EDL] Chapter 6 - Advanced Learning Techniques - Technical ReviewFigure 6-7: An example of few-shot learning with a large language model. One of the prominent deployment of such models is the GitHub’s Copilot software9 where GPT-3 is used for auto-completing code snippets0 码力 | 31 页 | 4.03 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 2 - Compression Techniqueshappens that after training a model with decent accuracy, the environmental constraints restrict the deployment of the solution for practical purposes. What we want for the reader to take away after reading0 码力 | 33 页 | 1.96 MB | 1 年前3 《Efficient Deep Learning Book》[EDL] Chapter 2 - Compression Techniqueshappens that after training a model with decent accuracy, the environmental constraints restrict the deployment of the solution for practical purposes. What we want for the reader to take away after reading0 码力 | 33 页 | 1.96 MB | 1 年前3
 《Efficient Deep Learning Book》[EDL] Chapter 4 - Efficient Architecturesand bigger models. Although they improved the quality of the solutions, the bigger models posed deployment challenges. What good is a model that cannot be deployed in practical applications! Efficient0 码力 | 53 页 | 3.92 MB | 1 年前3 《Efficient Deep Learning Book》[EDL] Chapter 4 - Efficient Architecturesand bigger models. Although they improved the quality of the solutions, the bigger models posed deployment challenges. What good is a model that cannot be deployed in practical applications! Efficient0 码力 | 53 页 | 3.92 MB | 1 年前3
 动手学深度学习 v2.0'Gumbel', 'HalfCauchy', 'HalfNormal', (continues on next page) 43 https://en.wikipedia.org/wiki/Venn_diagram 44 https://en.wikipedia.org/wiki/Markov_chain 45 https://discuss.d2l.ai/t/1762 2.7. 查阅文档 81 (continued0 码力 | 797 页 | 29.45 MB | 1 年前3 动手学深度学习 v2.0'Gumbel', 'HalfCauchy', 'HalfNormal', (continues on next page) 43 https://en.wikipedia.org/wiki/Venn_diagram 44 https://en.wikipedia.org/wiki/Markov_chain 45 https://discuss.d2l.ai/t/1762 2.7. 查阅文档 81 (continued0 码力 | 797 页 | 29.45 MB | 1 年前3
共 10 条
- 1













