Guia prático RISC-V
Atlas de uma Arquitetura Aberta
Primeira edição, 1.0.0clock de 1 GHz significa que o tempo por ciclo de clock é de 1 ns (1/109). Por exemplo, para o benchmark CoreMark [Gal-On and Levy 2012] (100.000 iterações), o desempenho no ARM-32 Cortex-A9 é 32.27 B hoje ante- cipam os resultados dos desvios usando preditores de hardware, que podem exceder a pre- cisão de 90% e trabalhar com qualquer tamanho de pipeline. Eles precisam apenas de um mecanismo para liberar Berkeley, 2015. S. Gal-On and M. Levy. Exploring CoreMark - a benchmark maximizing simplicity and efficacy. The Embedded Microprocessor Benchmark Consortium, 2012. Intel Corporation. Intel 64 and IA-32 Architectures0 码力 | 215 页 | 21.77 MB | 1 年前3
julia 1.10.10(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Base.sin – Method. sin(x) Compute sine of x, where x is in radians. See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 1692 页 | 6.34 MB | 3 月前3
Julia 1.10.9(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Base.sin – Method. sin(x) Compute sine of x, where x is in radians. See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 1692 页 | 6.34 MB | 3 月前3
Julia 1.8.0 DEV Documentationmarkup (we here use @noinline to prevent the optimizer from trying to be too clever and defeat our benchmark): CHAPTER 34. PERFORMANCE TIPS 393 @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not See also [sind], [sinpi], [sincos], [cis]. source Base.cos – Method. cos(x) Compute cosine of x, where x is in radians. See also [cosd], [cospi], [sincos], [cis]. source Base.Math.sincos – Method0 码力 | 1463 页 | 5.01 MB | 1 年前3
Julia 1.11.4(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Compute sine of x, where x is in radians.CHAPTER 44. MATHEMATICS 850 See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 2007 页 | 6.73 MB | 3 月前3
Julia 1.11.5 Documentation(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Compute sine of x, where x is in radians.CHAPTER 44. MATHEMATICS 850 See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 2007 页 | 6.73 MB | 3 月前3
Julia v1.9.4 Documentation(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Base.sin – Method. sin(x) Compute sine of x, where x is in radians. See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 1644 页 | 5.27 MB | 1 年前3
Julia 1.9.0 DEV Documentation(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Base.sin – Method. sin(x) Compute sine of x, where x is in radians. See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 1633 页 | 5.23 MB | 1 年前3
Julia 1.9.3 Documentation(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Base.sin – Method. sin(x) Compute sine of x, where x is in radians. See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 1644 页 | 5.27 MB | 1 年前3
Julia 1.9.0 rc2 Documentation(we here use @noinline to prevent the opti- mizer from trying to be too clever and defeat our benchmark): @noinline function inner(x, y) s = zero(eltype(x)) for i=eachindex(x) @inbounds s += x[i]*y[i] – in general, the speedup will be smaller. (In this particular example, the working set of the benchmark is small enough to fit into the L1 cache of the processor, so that memory access latency does not Base.sin – Method. sin(x) Compute sine of x, where x is in radians. See also sind, sinpi, sincos, cis, asin. Examples julia> round.(sin.(range(0, 2pi, length=9)'), digits=3) 1×9 Matrix{Float64}: 00 码力 | 1644 页 | 5.27 MB | 1 年前3
共 497 条
- 1
- 2
- 3
- 4
- 5
- 6
- 50













