Project Denver
General information | |
---|---|
Launched | 2014 (Denver) 2016 (Denver 2) |
Designed by | Nvidia |
Cache | |
L1 cache | 192 KiB per core (128 KiB I-cache with parity, 64 KiB D-cache with ECC) |
L2 cache | 2 MiB @ 2 cores |
Architecture and classification | |
Technology node | 28 nm (Denver 1) to 16 nm (Denver 2) |
Instruction set | ARMv8-A |
Physical specifications | |
Cores |
|
General information | |
---|---|
Launched | 2018 |
Designed by | Nvidia |
Max. CPU clock rate | to 2.3 GHz |
Cache | |
L1 cache | 192 KiB per core (128 KiB I-cache with parity, 64 KiB D-cache with ECC) |
L2 cache | 2 MiB @ 2 cores |
L3 cache | (4 MiB @ 8 cores, T194[1]) |
Architecture and classification | |
Technology node | 12 nm |
Instruction set | ARMv8.2-A |
Physical specifications | |
Cores |
|
- For the Soviet HIV disinformation campaign, see Operation Denver.
Project Denver is the codename of a central processing unit designed by Nvidia that implements the ARMv8-A 64/32-bit instruction sets using a combination of simple hardware decoder and software-based binary translation (dynamic recompilation) where "Denver's binary translation layer runs in software, at a lower level than the operating system, and stores commonly accessed, already optimized code sequences in a 128 MB cache stored in main memory".[2] Denver is a very wide in-order superscalar pipeline. Its design makes it suitable for integration with other SIPs cores (e.g. GPU, display controller, DSP, image processor, etc.) into one die constituting a system on a chip (SoC).
Project Denver is targeted at mobile computers, personal computers, servers, as well as supercomputers.[3] Respective cores have found integration in the Tegra SoC series from Nvidia. Initially Denver cores was designed for the 28 nm process node (Tegra model T132 aka "Tegra K1"). Denver 2 was an improved design that built for the smaller, more efficient 16 nm node. (Tegra model T186 aka "Tegra X2").
In 2018, Nvidia released an improved design (codename: "Carmel", based on ARMv8 (64-bit; variant: ARM-v8.2[1] with 10-way superscalar, functional safety, dual execution, parity & ECC) got integrated into the Tegra Xavier SoC offering a total of 8 cores (or 4 dual-core pairs).[4][failed verification] The Carmel CPU core supports full Advanced SIMD (ARM NEON), VFP (Vector Floating Point), and ARMv8.2-FP16.[1] First published testings of Carmel cores integrated in the Jetson AGX development kit by third party experts took place in September 2018 and indicated a noticeably increased performance as should expected for this real world physical manifestation compared to predecessors systems, despite all doubts the used quickness of such a test setup in general an in particular implies.[5] The Carmel design can be found in the Tegra model T194 ("Tegra Xavier") that is designed with a 12 nm structure size.
Overview
[edit]- Pipelined in-order superscalar processor
- 2-way decoder for ARM instructions
- On-the-fly binary translation of ARM code into internal VLIW instructions by hardware translator, uses software emulation as fallback
- Translation can reorder ARM instructions, and remove ones that do not contribute to the result[2]
- Up to 7 micro-ops per clock cycle with translated VLIW instructions; cannot run simultaneously with ARM decoder
- L1 cache: 128 KiB instruction + 64 KiB data per core (4-way set associative)
- 2 MiB shared L2 cache between two Denver cores (16-way set-associative)[6]
- Denver also sets aside 128 MiB of main memory to store translated VLIW code; this part of memory is inaccessible to the main operating system.
- Up to 2.5 GHz clockspeeds on TSMC 28 nm process[7]
Chips
[edit]A dual-core Denver CPU was paired with a Kepler-based GPU solution to form the Tegra K1; the dual-core 2.3 GHz Denver-based K1 was first used in the HTC Nexus 9 tablet, released November 3, 2014.[8][9] Note, however, that the quad-core Tegra K1, while using the same name, isn't based on Denver.
The Nvidia Tegra X2 has two Denver2 cores paired with four Cortex-A57 cores using a coherent HMP (Heterogeneous Multi-Processor Architecture) approach.[10] They are paired with a Pascal GPU.
The Tegra Xavier has a Volta GPU and several special purpose accelerators. The 8 Carmel CPU cores is divided into 4 ASIC macro blocks (each having 2 cores,) matched to each other with a crossbar and 4 MiB of shared L3 memory.
History
[edit]The existence of Project Denver was revealed at the 2011 Consumer Electronics Show.[11] In a March 4, 2011 Q&A article CEO Jen-Hsun Huang revealed that Project Denver is a five-year 64-bit ARMv8-A architecture CPU development on which hundreds of engineers had already worked for three and half years and which also has 32-bit ARM instruction set (ARMv7) backward compatibility.[12] Project Denver was started in Stexar Company (Colorado) as an x86-compatible processor using binary translation, similar to projects by Transmeta. Stexar was acquired by Nvidia in 2006.[13][14][15]
According to Tom's Hardware, there are engineers from Intel, AMD, HP, Sun and Transmeta on the Denver team, and they have extensive experience designing superscalar CPUs with out-of-order execution, very long instruction words (VLIW) and simultaneous multithreading (SMT).[16]
According to Charlie Demerjian, the Project Denver CPU may internally translate the ARM instructions to an internal instruction set, using firmware in the CPU.[17] Also according to Demerjian, Project Denver was originally intended to support both ARM and x86 code using code morphing technology from Transmeta, but was changed to the ARMv8-A 64-bit instruction set because Nvidia could not obtain a license to Intel's patents.[17]
The first consumer device shipping with Denver CPU cores, Google's Nexus 9, was announced on October 15, 2014. The tablet was manufactured by HTC and features the dual-core Tegra K1 SoC. The Nexus 9 was the first 64-bit Android device available to consumers.[18]
See also
[edit]References
[edit]- ^ a b c NVIDIA Jetson AGX Xavier Delivers 32 TeraOps for New Era of AI in Robotics by Dustin Franklin (Nvidia development team for Jetson), December 12, 2018
- ^ a b Wasson, Scott (August 11, 2014). "Nvidia claims Haswell-class performance for Denver CPU core". The Tech Report. Retrieved August 14, 2014.
- ^ Dally, Bill (January 5, 2011). ""PROJECT DENVER" PROCESSOR TO USHER IN NEW ERA OF COMPUTING". Official Nvidia blog.
- ^ NVIDIA Drive Xavier SOC Detailed by Hassan Mujtaba on Jan 8, 2018 via WccfTech
- ^ "A Quick Test of NVIDIA's "Carmel" CPU Performance".
- ^ Hachman, Mark (August 11, 2014). "Nvidia reveals PC-like performance for 'Denver' Tegra K1". PC World. Retrieved September 19, 2014.
- ^ Anthony, Sebastian (January 6, 2014). "Tegra K1 64-bit Denver core analysis: Are Nvidia's x86 efforts hidden within?". ExtremeTech. Retrieved January 7, 2014.
- ^ "Nexus 9 storms through Geekbench, Tegra K1 outperforms Apple iPhone 6's A8". 16 October 2014.
- ^ Shimpi, Anand (January 5, 2014). "NVIDIA Announces Tegra K1 SoC with Optional Denver CPU Cores". Anandtech. Archived from the original on January 7, 2014. Retrieved January 6, 2014.
- ^ NVIDIA Unveils Tegra Parker SOC at Hot Chips – Built on 16nm TSMC Process, Features Pascal and Denver 2 Duo Architecture, August 22, 2016
- ^ http://www.nvidia.com/object/ces2011.html Nvidia's press conference webcast
- ^ Takahashi, Dean (March 4, 2011). "Q&A: Nvidia chief explains his strategy for winning in mobile computing".
- ^ Valich, Theo (December 12, 2011). "NVIDIA Project Denver "Lost in Rockies", to Debut in 2014-15".
- ^ Miller, Paul (October 19, 2006). "NVIDIA has x86 CPU in the works?". Engadget. Retrieved October 19, 2013.
- ^ Valich, Theo (March 20, 2013). "New Tegra Roadmap Reveals Logan, Parker and Kayla CUDA Strategy".
- ^ Parrish, Kevin (October 14, 2013). "64-bit Nvidia Tegra 6 "Parker" Chip May Arrive in 2014. Devices with a 64-bit Tegra 6 could launch before the end of 2014". Tom's Hardware & ExtremeTech. Retrieved October 19, 2013.
- ^ a b Demerjian, Charlie (August 5, 2011). "What is Project Denver based on?". Semiaccurate.
- ^ Amadeo, Ron (October 15, 2014). "Google announces Nexus 6, Nexus 9, Nexus Player, and Android 5.0 Lollipop".
External links
[edit]- Valich, Theo (September 20, 2012). "NVIDIA Project Boulder Revealed: Tegra's Competitor Hides in GPU Group".
- Gwennap, Linley (August 18, 2014). "Nvidia's First CPU Is a Winner. Denver Uses Dynamic Translation to Outperform Mobile Rivals" (PDF). MPR, Linley Group.