 Dynamic Model in TVMmodel in TVM ● Support Any-dim in typing ● Use shape function to compute the type at runtime ● Virtual machine as a new runtime for Relay ● Dynamic codegen (WIP) ○ Kernel dispatch for a single op ○ Data dependent© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Relay virtual machine Relay Executable relay.vm.compile Relay Object (hardware independent) Code segment VM a data type using the entries from a register. AllocClosure Allocates a closure with a lowered virtual machine function. If Jumps to the true or false offset depending on the condition. Goto Unconditionally0 码力 | 24 页 | 417.46 KB | 5 月前3 Dynamic Model in TVMmodel in TVM ● Support Any-dim in typing ● Use shape function to compute the type at runtime ● Virtual machine as a new runtime for Relay ● Dynamic codegen (WIP) ○ Kernel dispatch for a single op ○ Data dependent© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Relay virtual machine Relay Executable relay.vm.compile Relay Object (hardware independent) Code segment VM a data type using the entries from a register. AllocClosure Allocates a closure with a lowered virtual machine function. If Jumps to the true or false offset depending on the condition. Goto Unconditionally0 码力 | 24 页 | 417.46 KB | 5 月前3
 OctoML OSS 2019 11 8areas: o Core Infrastructure Improvements to TVM o_uTVM: support for microcontrollers in TVM o_ Virtual Machine and dynamic NNs support (w/ AWS folks) o_ Improved NLP support, with focus on transformers o QQ octoML BERT has many reshape operations, which are currently implemented using copy, 10 Virtual Machine e Many improvements from contributors at UW, AWS, and OctoML. e Initial implementation0 码力 | 16 页 | 1.77 MB | 5 月前3 OctoML OSS 2019 11 8areas: o Core Infrastructure Improvements to TVM o_uTVM: support for microcontrollers in TVM o_ Virtual Machine and dynamic NNs support (w/ AWS folks) o_ Improved NLP support, with focus on transformers o QQ octoML BERT has many reshape operations, which are currently implemented using copy, 10 Virtual Machine e Many improvements from contributors at UW, AWS, and OctoML. e Initial implementation0 码力 | 16 页 | 1.77 MB | 5 月前3
 Trends Artificial Intelligence
Erica Virtual Assistant (6/18) Note: We assume a start at zero users from Erica’s launch in 6/18. Pilot users excluded. Source: Bank of America (2/21, 4/24, 2/25) Bank of America Erica Virtual Assistant 10/21 3/22 8/22 1/23 6/23 11/23 4/24 9/24 2/25 Cumulative Client Interactions with Erica Virtual Assistant (MM) Note: Erica is a conversational AI built into Bank of America’s mobile app that helps0 码力 | 340 页 | 12.14 MB | 4 月前3 Trends Artificial Intelligence
Erica Virtual Assistant (6/18) Note: We assume a start at zero users from Erica’s launch in 6/18. Pilot users excluded. Source: Bank of America (2/21, 4/24, 2/25) Bank of America Erica Virtual Assistant 10/21 3/22 8/22 1/23 6/23 11/23 4/24 9/24 2/25 Cumulative Client Interactions with Erica Virtual Assistant (MM) Note: Erica is a conversational AI built into Bank of America’s mobile app that helps0 码力 | 340 页 | 12.14 MB | 4 月前3
 TVM: Where Are We Goingremote_mod = remote.load_module(“mylib.so") func = remote_mod[“npufunction0"] func(remote_a, remote_b)Virtual Machine: Supporting Dynamic Workload Dynamic shape workloads More runtime objects: Arrays, Tuples0 码力 | 31 页 | 22.64 MB | 5 月前3 TVM: Where Are We Goingremote_mod = remote.load_module(“mylib.so") func = remote_mod[“npufunction0"] func(remote_a, remote_b)Virtual Machine: Supporting Dynamic Workload Dynamic shape workloads More runtime objects: Arrays, Tuples0 码力 | 31 页 | 22.64 MB | 5 月前3
 PAI & TVM Meetup - Shanghai 20191116buffer to hide memory load latency 。 storage align to reduce bank conflicts of shared memory 。 Virtual threads for data reuse (on going) Performance on V100 (FP16) 计算平台事业部 COMPUTING PLATFORM 512, 160 码力 | 26 页 | 5.82 MB | 5 月前3 PAI & TVM Meetup - Shanghai 20191116buffer to hide memory load latency 。 storage align to reduce bank conflicts of shared memory 。 Virtual threads for data reuse (on going) Performance on V100 (FP16) 计算平台事业部 COMPUTING PLATFORM 512, 160 码力 | 26 页 | 5.82 MB | 5 月前3
 OpenAI - AI in the EnterpriseProduct Note: Operator Operator is an example of OpenAI’s agentic approach. Leveraging its own virtual browser, Operator can navigate the web, click on buttons, fill in forms, and gather data just like0 码力 | 25 页 | 9.48 MB | 5 月前3 OpenAI - AI in the EnterpriseProduct Note: Operator Operator is an example of OpenAI’s agentic approach. Leveraging its own virtual browser, Operator can navigate the web, click on buttons, fill in forms, and gather data just like0 码力 | 25 页 | 9.48 MB | 5 月前3
共 6 条
- 1













