[FAQ] Nvidia répond à vos questions, RTX Ampère compris (anglais uniquement) sur le forum Hardware - 05-09-2020 15:45:46

Pseudo supprimé

Niveau 10

05 septembre 2020 à 15:45:46

Ce sujet concernera uniquement la FAQ Q/A avec Nvidia, plusieurs questions ont été posées, dont notamment celles liées avec la nouvelle génération RTX Ampère et ainsi celle en rapport avec la compatibilité du PCI-E 3.0 et 4.0 (mais vous l'avez sûrement déjà vue).

Évidemment tout est en anglais, il y a trop de choses à traduire mais la traduction automatique est largement compréhensible et si vous n'avez pas Chrome prenez simplement google Traduction il y a très peu d'erreurs. Je les mets directement ici pour ceux qui ont une connexion lente.

RTX Ampère

Why only 10 GB of memory for RTX 3080 ? How was that determined to be a sufficient number, when it is stagnant from the previous generation ?

SpoilAfficherMasquer

[Justin Walker] We’re constantly analyzing memory requirements of the latest games and regularly review with game developers to understand their memory needs for current and upcoming games. The goal of 3080 is to give you great performance at up to 4k resolution with all the settings maxed out at the best possible price.

In order to do this, you need a very powerful GPU with high speed memory and enough memory to meet the needs of the games. A few examples - if you look at Shadow of the Tomb Raider, Assassin’s Creed Odyssey, Metro Exodus, Wolfenstein Youngblood, Gears of War 5, Borderlands 3 and Red Dead Redemption 2 running on a 3080 at 4k with Max settings (including any applicable high res texture packs) and RTX On, when the game supports it, you get in the range of 60-100fps and use anywhere from 4GB to 6GB of memory.

Extra memory is always nice to have but it would increase the price of the graphics card, so we need to find the right balance.

When the slide says RTX 3070 is equal or faster than 2080 Ti, are we talking about traditional rasterization or DLSS/RT workloads ? Very important if you could clear it up, since no traditional rasterization benchmarks were shown, only RT/DLSS supporting games.

SpoilAfficherMasquer

[Justin Walker] We are talking about both. Games that only support traditional rasterization and games that support RTX (RT+DLSS).

Does Ampere support HDMI 2.1 with the full 48Gbps bandwidth ?

SpoilAfficherMasquer

[Qi Lin] Yes. The NVIDIA Ampere Architecture supports the highest HDMI 2.1 link rate of 12Gbs/lane across all 4 lanes, and supports Display Stream Compression (DSC) to be able to power up to 8K, 60Hz in HDR.

Could you elaborate a little on this doubling of CUDA cores ? How does it affect the general architectures of the GPCs ? How much of a challenge is it to keep all those FP32 units fed? What was done to ensure high occupancy ?

SpoilAfficherMasquer

[Tony Tamasi] One of the key design goals for the Ampere 30-series SM was to achieve twice the throughput for FP32 operations compared to the Turing SM. To accomplish this goal, the Ampere SM includes new datapath designs for FP32 and INT32 operations. One datapath in each partition consists of 16 FP32 CUDA Cores capable of executing 16 FP32 operations per clock. Another datapath consists of both 16 FP32 CUDA Cores and 16 INT32 Cores. As a result of this new design, each Ampere SM partition is capable of executing either 32 FP32 operations per clock, or 16 FP32 and 16 INT32 operations per clock. All four SM partitions combined can execute 128 FP32 operations per clock, which is double the FP32 rate of the Turing SM, or 64 FP32 and 64 INT32 operations per clock.

Doubling the processing speed for FP32 improves performance for a number of common graphics and compute operations and algorithms. Modern shader workloads typically have a mixture of FP32 arithmetic instructions such as FFMA, floating point additions (FADD), or floating point multiplications (FMUL), combined with simpler instructions such as integer adds for addressing and fetching data, floating point compare, or min/max for processing results, etc. Performance gains will vary at the shader and application level depending on the mix of instructions. Ray tracing denoising shaders are good examples that might benefit greatly from doubling FP32 throughput.

Doubling math throughput required doubling the data paths supporting it, which is why the Ampere SM also doubled the shared memory and L1 cache performance for the SM. (128 bytes/clock per Ampere SM versus 64 bytes/clock in Turing). Total L1 bandwidth for GeForce RTX 3080 is 219 GB/sec versus 116 GB/sec for GeForce RTX 2080 Super.

Like prior NVIDIA GPUs, Ampere is composed of Graphics Processing Clusters (GPCs), Texture Processing Clusters (TPCs), Streaming Multiprocessors (SMs), Raster Operators (ROPS), and memory controllers.

The GPC is the dominant high-level hardware block with all of the key graphics processing units residing inside the GPC. Each GPC includes a dedicated Raster Engine, and now also includes two ROP partitions (each partition containing eight ROP units), which is a new feature for NVIDIA Ampere Architecture GA10x GPUs. More details on the NVIDIA Ampere architecture can be found in NVIDIA’s Ampere Architecture White Paper, which will be published in the coming days.

Any idea if the dual airflow design is going to be messed up for inverted cases? More than previous designs? Seems like it would blow it down on the cpu. But the CPU cooler would still blow it out the case. Maybe it’s not so bad.
Second question. 10x quieter than the Titan for the 3090 is more or less quieter than a 2080 Super (Evga ultra fx for example)?

SpoilAfficherMasquer

[Qi Lin] The new flow through cooling design will work great as long as chassis fans are configured to bring fresh air to the GPU, and then move the air that flows through the GPU out of the chassis. It does not matter if the chassis is inverted.

The Founders Edition RTX 3090 is quieter than both the Titan RTX and the Founders Edition RTX 2080 Super. We haven’t tested it against specific partner designs, but I think you’ll be impressed with what you hear… or rather, don’t hear.

Will the 30 series cards be supporting 10bit 444 120fps ? Traditionally Nvidia consumer cards have only supported 8bit or 12bit output, and don’t do 10bit. The vast majority of hdr monitors/TVs on the market are 10bit.

SpoilAfficherMasquer

[Qi Lin] The 30 series supports 10bit HDR. In fact, HDMI 2.1 can support up to 8K@60Hz with 12bit HDR, and that covers 10bit HDR displays.

What breakthrough in tech let you guys massively jump to the 3xxx line from the 2xxx line? I knew it would be scary, but it's insane to think about how much more efficient and powerful these cards are. Can these cards handle 4k 144hz?

SpoilAfficherMasquer

[Justin Walker] There were major breakthroughs in GPU architecture, process technology and memory technology to name just a few. An RTX 3080 is powerful enough to run certain games maxed out at 4k 144fps - Doom Eternal, Forza 4, Wolfenstein Youngblood to name a few. But others - Red Dead Redemption 2, Control, Borderlands 3 for example are closer to 4k 60fps with maxed out settings.

What kind of advancements can we expect from DLSS? Most people were expecting a DLSS 3.0, or, at the very least, something like DLSS 2.1. Are you going to keep improving DLSS and offer support for more games while maintaining the same version?

SpoilAfficherMasquer

DLSS SDK 2.1 is out and it includes three updates:

- New ultra performance mode for 8K gaming. Delivers 8K gaming on GeForce RTX 3090 with a new 9x scaling option.

- VR support. DLSS is now supported for VR titles.

- Dynamic resolution support. The input buffer can change dimensions from frame to frame while the output size remains fixed. If the rendering engine supports dynamic resolution, DLSS can be used to perform the required upscale to the display resolution.

How bad would it be to run the 3080 off of a split connector instead of two separate cable. would it be potentially dangerous to the system if I’m not overclocking?

SpoilAfficherMasquer

The recommendation is to run two individual cables. There’s a diagram here. https://www.nvidia.com/en-us/geforce/graphics-cards/30-series/rtx-3080/?nvmid=systemcomp

RTX IO

Could we see RTX IO coming to machine learning libraries such as Pytorch? This would be great for performance in real-time applications

SpoilAfficherMasquer

[Tony Tamasi] NVIDIA delivered high-speed I/O solutions for a variety of data analytics platforms roughly a year ago with NVIDIA GPU DirectStorage. It provides for high-speed I/O between the GPU and storage, specifically for AI and HPC type applications and workloads. For more information please check out: https://developer.nvidia.ia.com/blog/gpudirect-storage/

Does RTX IO allow use of SSD space as VRAM? Or am I completely misunderstanding?

SpoilAfficherMasquer

[Tony Tamasi] RTX IO allows reading data from SSD’s at much higher speed than traditional methods, and allows the data to be stored and read in a compressed format by the GPU, for decompression and use by the GPU. It does not allow the SSD to replace frame buffer memory, but it allows the data from the SSD to get to the GPU, and GPU memory much faster, with much less CPU overhead.

Will there be a certain ssd speed requirement for RTX I/O?

SpoilAfficherMasquer

[Tony Tamasi] There is no SSD speed requirement for RTX IO, but obviously, faster SSD’s such as the latest generation of Gen4 NVMe SSD’s will produce better results, meaning faster load times, and the ability for games to stream more data into the world dynamically. Some games may have minimum requirements for SSD performance in the future, but those would be determined by the game developers. RTX IO will accelerate SSD performance regardless of how fast it is, by reducing the CPU load required for I/O, and by enabling GPU-based decompression, allowing game assets to be stored in a compressed format and offloading potentially dozens of CPU cores from doing that work. Compression ratios are typically 2:1, so that would effectively amplify the read performance of any SSD by 2x.

Will the new GPUs and RTX IO work on Windows 7/8.1?

SpoilAfficherMasquer

[Tony Tamasi] RTX 30-series GPUs are supported on Windows 7 and Windows 10, RTX IO is supported on Windows 10.

I am excited for the RTX I/O feature but I partially don't get how exactly it works? Let's say I have a NVMe SSD, a 3070 and the latest Nvidia drivers, do I just now have to wait for the windows update with the DirectStorage API to drop at some point next year and then I am done or is there more?

SpoilAfficherMasquer

[Tony Tamasi] RTX IO and DirectStorage will require applications to support those features by incorporating the new API’s. Microsoft is targeting a developer preview of DirectStorage for Windows for game developers next year, and NVIDIA RTX gamers will be able to take advantage of RTX IO enhanced games as soon as they become available.

Pseudo supprimé

Niveau 10

05 septembre 2020 à 15:45:57

cocoy72

Niveau 29

05 septembre 2020 à 16:03:27

impeccable, beau boulo, et très interessent.
merci

Kobalt1001

Niveau 30

05 septembre 2020 à 16:23:26

Merci pour ce topic qui va mettre un point final au débat pci 3.0 ou 4.0.
C'est bien pour dans 2, 3ans max.

2GisColorful

Niveau 19

05 septembre 2020 à 16:30:58

Le 05 septembre 2020 à 16:23:26 kobalt1001 a écrit :
Merci pour ce topic qui va mettre un point final au débat pci 3.0 ou 4.0.
C'est bien pour dans 2, 3ans max.

Les SSD saturant le 3.0 sont déjà là

Diablo77190

Niveau 8

05 septembre 2020 à 16:36:14

elles sont dispo à quel heur ?

mvppaulo

Niveau 48

05 septembre 2020 à 16:46:34

Pour la deuxième question du coup, en fait le graphique qui met la 3070 au niveau de la 2080ti c'est purement pour les performances RTX/DLSS et ça concerne pas les performances classiques en fps ? Je suis pas sûr d'avoir compris

BeaujolaisS

Niveau 7

05 septembre 2020 à 16:49:18

Le 05 septembre 2020 à 16:46:34 Mvppaulo a écrit :
Pour la deuxième question du coup, en fait le graphique qui met la 3070 au niveau de la 2080ti c'est purement pour les performances RTX/DLSS et ça concerne pas les performances classiques en fps ? Je suis pas sûr d'avoir compris

C'est le contraire ! Ils ont dit pour les deux.

Sarag0s

Niveau 10

05 septembre 2020 à 17:01:08

Donc ils confirme que la 3070 est bien égal voir supérieur à une 2080 ti si je comprends bien.

Sinon merci pour ce lot d'information.

Majorhap

Niveau 24

05 septembre 2020 à 17:03:28

Pseudo supprimé

Niveau 10

05 septembre 2020 à 17:09:26

Le 05 septembre 2020 à 16:23:26 kobalt1001 a écrit :
Merci pour ce topic qui va mettre un point final au débat pci 3.0 ou 4.0.
C'est bien pour dans 2, 3ans max.

Ouais en gros si tu fais une config futur proof dès maintenant tu prends un gros Ryzen 4000, une 3080 et un SSD PCIe4 et t'es tranquille quoi, et les gains arriveront quand ils arriveront

BeaujolaisS

Niveau 7

05 septembre 2020 à 17:38:28

Le 05 septembre 2020 à 17:33:59 [Howaito] a écrit :
Le 05 septembre 2020 à 16:36:14 diablo77190 a écrit :
elles sont dispo à quel heur ?
Généralement c'est à 15h chez nous. Après la date je ne sais plus.

Le CM d'Nvidia a dit à 6 AM PST, ce qui correspond bien à 15h chez nous. Et c'est le 17 pour la date.

bboy-mat

Niveau 21

05 septembre 2020 à 17:58:34

Propre l'article très intéressant notamment le passage concernant Nvidia reflex.
Étant gros joueur de fps competitifs notamment valorant j'attends cette option avec impatience !

Theboy_maybe

Niveau 10

05 septembre 2020 à 18:03:45

[Justin Walker] We are talking about both. Games that only support traditional rasterization and games that support RTX (RT+DLSS).

Plus de débat possible

Bourlingue

Niveau 10

05 septembre 2020 à 18:06:06

Possible de traduire svp ? (je sais utiliser google trad, mais je voudrais être sur de bien tout comprendre)

MightyDucky

Niveau 16

05 septembre 2020 à 18:18:19

Merci pour ces réponses.

Honnêtement vous en pensez quoi de leur réponse sur la VRAM ? Ça se passe comment si dans deux ans la "vraie" next gen arrive et qu'une 3070 se retrouve en pls ?

BulkJVC

Niveau 41

05 septembre 2020 à 18:31:04

Le 05 septembre 2020 à 18:18:19 MightyDucky a écrit :
Merci pour ces réponses.
Honnêtement vous en pensez quoi de leur réponse sur la VRAM ? Ça se passe comment si dans deux ans la "vraie" next gen arrive et qu'une 3070 se retrouve en pls ?

Bah on baissera quelques options graphiques.

Squall1er

Niveau 43

05 septembre 2020 à 19:27:25

Je up, c'est insupportable, comment ce genre de topic passe en deuxième page ouaich

tontonfomoso2

Niveau 12

05 septembre 2020 à 20:39:04

Pourquoi il publie reflex sur la gen 900?

il nous ont bien skip pour le freesync
quant faut racheter du matos (moniteur) la il publie leur techno pour les ancienne carte

DjRaoult

Niveau 10

05 septembre 2020 à 23:26:26

La réponse qu'il donne à propos du fait que le refroidissement de la FE envoi l'air chaud dans la gueule du CPU me laisse perplexe

The new flow through cooling design will work great as long as chassis fans are configured to bring fresh air to the GPU, and then move the air that flows through the GPU out of the chassis.

En gros il dit que le refroidissement de la CG fonctionnera très bien si on a des ventilateurs en intake qui envoient l'air frais à l'intérieur du boitier vers la CG (Ok cool, c'est un flux d'air classique)
Mais par contre il ignore complètement la partie qui parle de l'air chaud expulser dans le ventirad du CPU
Si ces cartes grimpe à 70 / 80° en jeu, je pense que l'air qui s'en dégagera va être plus que tiède et avoir ce flux d'air chaud constant aspiré par le ventirad, ça va pas être top

Informatique

Hardware