Intel's 45nm 'Nehalem' processor design will incorporate the second generation of the chip maker's SSE 4 technology. For now, the company is calling the post-'Penryn' Streaming SIMD Extensions instruction set SSE 4.2. Nehalem's implementation of SSE 4 essentially matches that of Penryn. The key additions centre on the …

COMMENTS

House rules Send corrections

This topic is closed for new posts.

Tuesday 18th March 2008 13:49 GMT Anonymous Coward

Logic behind HT

I'm not sure I get that one. Hyperthreading was very useful prior to the appearance of multi-core CPUs, yet it scaled pretty badly. It added performance to 1 or 2 CPU machines, however you wouldn't see any gain in performance with 4 CPUs.

I just wonder how that's going to work - and actually improve performance with Quad Core CPUs.

0 0
Wednesday 19th March 2008 03:06 GMT Anonymous Coward

Re: Logic behind HT

HT allows two threads to utilize different areas of the execution core. Hypothetically, one could have a thread running an integer operation, while another runs a floating point operation simultaneously. You are right, that there is less to gain from HT in a multi-core CPU, but I suppose it's like Intel says "on getting the job finished more quickly, the better to improve power efficiency." To me HT isn't that important, and it doesn't bother me one bit that my Core 2 Duo doesn't have it, but I guess it doesn't hurt to have it either.

0 0
Wednesday 19th March 2008 08:33 GMT Anonymous Coward

Re :Re: Logic behind HT

HT works as you suggest however there is more to it than you suggest.

Each core up till now had 3 seperate instruction pipelines. On a single thread the instructions for the thread would be analysed and then split so that independent ones (that don't rely on the previous result) would be run on different execution cores. I believe this also happened with branches in some cases where both sides of the branch would be evaluated whilst it was waiting for the result of the branch test to come back.

On net burst architecture because the execution pipeline was very long - so to get branch results back took a long time. Also splitting up the instructions into independent ones was harder because it took so much longer to get results back, making the dependency chain longer. As a result of this quite a lot of the time one or more of the execution cores were sat idle. So HT could take advantage of this because the second thread is always independent from the first and so the instructions could be interleaved.

However the newer Core architectures have a much shorter execution pipeline so there was less idle space in the execution cores to be taken advantage of by hyper threading. Now though they're adding a fourth execution core which they must feel means there's enough spare slots on the cores to support another thread.

This also should mean for power management that if the CPU/OS detects that two threads run happily on the same core then additional cores can be shut down, saving on power usage.

Also as we write more and more core/thread hungry apps then HT will be just as useful for multi-core as it was for single core. It's also probably pretty handy in a server environment too where the cores are often IO bound and so having more hardware threads is a bigger win.

0 0
Wednesday 19th March 2008 23:28 GMT John Robinson

Forget Nehalem ...

Intel still haven't managed to get the previous generation into production, despite promises. Where is the Q9450, for instance? Has ANYone got one, apart from engineering samples?

0 0

This topic is closed for new posts.

Topics

Special Features

Vendor Voice

Resources

User topics

Article topics

User topics

Article topics

Intel tweaks SSE 4 to speed text processing

COMMENTS

Logic behind HT

Re: Logic behind HT

Re :Re: Logic behind HT

Forget Nehalem ...

Other stories you might like

Kaby Lake-G chip back from the grave, now on modest firewall-router-NAS mobo

Meteor Lake CPUs splash down in socketed motherboards for edge and embedded workloads

Intel over the Moon as Lunar Lake’s NPU performance TOPS Meteor Lake

Intel Gaudi's third and final hurrah is an AI accelerator built to best Nvidia's H100

Intel Meteor Lake makes unexpected leap to socketed motherboards

Intel's foundry business bled $7B in 2023 with more to come

It's 2024 and Intel silicon is still haunted by data-spilling Spectre

Intel fuels Huawei's AI PC ambitions with Meteor Lake CPUs in MateBook X Pro

US lawmakers rage over Intel Meteor Lake-powered Huawei PC

SiFive is back with another 64-bit RISC-V dev board – hopefully

Intel courts devs with open arms and exotic hardware

Intel preps export-friendly lower-power Gaudi 3 AI chips for China

About Us

Our Websites

Your Privacy