G5 / G6 / G7
  • Butterfly
    Butterfly
    Maurix
    Posts: 81 from 2004/1/8
    Nice :-) thinking about a dual G5 Machine or maybe
    a G6 or G7 or ???? maybe bee happy with a
    PEG2 working and a upcoming Morphos 1.5

    :-) please
  • »02.05.04 - 18:56
    Profile
  • Moderator
    Kronos
    Posts: 2323 from 2003/2/24
    :angel:
  • »02.05.04 - 19:28
    Profile
  • Butterfly
    Butterfly
    ButterflyKisses
    Posts: 85 from 2004/3/10
    i think it is a little early to be thinking about the G7 as well as the G6 are wayyy far away as far as an actual release...
  • »03.05.04 - 01:11
    Profile
  • Butterfly
    Butterfly
    Posts: 81 from 2004/3/20
    From: USA
    I agree with Maurix, at least a G5. That would put us on the 'cutting edge' so far.

    Harry
    Config: Pegasos II G4 1ghz, 1024mb memory,
    Radeon 9200,160gb hd total, Sound Blaster Live!
  • »12.07.04 - 13:39
    Profile
  • Priest of the Order of the Butterfly
    Priest of the Order of the Butterfly
    Spidey
    Posts: 568 from 2003/2/24
    From: Netherlands
    Honestly speaking, I really want up-to-date software instead of speed increase with MorphOS. But hey, ofcourse a G5 would be nice to have :-)

    Bye,

    Spidey
  • »12.07.04 - 13:46
    Profile Visit Website
  • Priest of the Order of the Butterfly
    Priest of the Order of the Butterfly
    DJBase
    Posts: 745 from 2003/4/6
    From: Germany
    First I would like to see more quality apps for MorphOS not only quick and dirty ports. Then we can talk about faster CPUs. A G5 is a bit useless if there are no apps that could take advantage of it.
    Mac mini, PowerPC G4 1.5 GHz, ATI Radeon 9200 64 MB, 1 GB RAM, 80 GB HDD, MorphOS 3.18
    PowerBook, PowerPC G4 1.67 GHz, ATI Radeon 9700 128 MB, 2 GB RAM, 250 GB mSATA HDD, MorphOS 3.18
  • »12.07.04 - 14:13
    Profile Visit Website
  • Butterfly
    Butterfly
    ultraspec
    Posts: 94 from 2004/1/29
    Pegasos G5: Oracle announces MorphOS support? Just kidding :)
  • »12.07.04 - 15:57
    Profile Visit Website
  • Caterpillar
    Caterpillar
    Posts: 26 from 2003/9/9
    G6 and G7 are not even out yet, so it is not a good thing to base an entire product on that. G5 however would be a very very good idea. I think a G5 product would also stay rather long at the top of the line, or not much below it.

    Why do I think that? Chips speed will not increase as fast as it used to, only parallellising right now (like in the Cell chip or with dual core chips) would be able to gain you some fast advancements. But the gains in most areas of parallellising (except matrix multiplication - like used in 3D and other streaming media) will diminish with increasing number of "processors".

    Basically we see that in the chips industry there is a turning point. Moores law can no longer be maintained, and die shrinks do not gain as much as they used to because of additional complications. Things have to be done differently, and that costs a lot of extra for the companies who produce these chips (or the machines to build them). For example we see that with a transistor shrink from 90 to 60 nm the gates built using silicon dioxides as isolators become susceptive to current tunneling. Since tunneling goes with an exponential function of distance between the gate and the part where current flows, you must make the gate isolator "longer" to reduce tunneling. For this you need a higher k material like siliconoxinitride. However that material is harder to etch selectively with the same feature aspect ratios as silicon dioxide vs Si. Also you can contaminate your factories with some materials. So these workarounds to make die shrinks more feasible all have their drawbacks. We currently see that in the 130 to 90 nm transition, problems are still hampering high clock speeds. This will be even more so with the transition from 90 nm to 60 nm.

    So I hope that a G5 Peg3 will be out soon with MOS 1.5 with a complete C++ compliant gcc version :-)
  • »24.07.04 - 14:24
    Profile
  • Just looking around
    Hammer
    Posts: 15 from 2004/7/24
    Note that P4’s integer units are double pumped i.e. 2X the rated clock speed e.g. P4@ 3.2 Ghz has integer units @6.4 Ghz.

    Reference
    http://chip-architect.com/news/2001_10_07_AMDs_long_term_options.html
  • »25.07.04 - 08:05
    Profile
  • Acolyte of the Butterfly
    Acolyte of the Butterfly
    BigGun
    Posts: 150 from 2004/6/18
    From: Nagold - Germany
    Quote:


    Hammer wrote:
    Note that P4’s integer units are double pumped i.e. 2X the rated clock speed e.g. P4@ 3.2 Ghz has integer units @6.4 Ghz.




    So what?
    Everyone knows this since years.

    But did you know that the engine of my motorbike can do over 10000 revolutions per minute?
    This is many times more revolutions than the engine of ocean ship can do!

    Revolutions != power!
    Otherwise the navi would bould motorcycle engines into their ships.

    Everyone knows that Clockrate != processing-power.

    The Pentium IV was an experiment to build a ship tuned for very high clockrates.
    Because many people thing high clock rates equal high performance those chips sell good.
    Mostly all CPUs have more processing power clock per clock than the Pentium IV.
    Even Intel admitted that the race for higclockrates is a deadend.
    Intel will focus again on chips like the Pentium M which isn't double pumped and doesn't
    reach the high clockrates of the Pentium IV but still outperforms much higher clocked P IVs.

    We should not wait for Pegasos with G5,G6, ..or G99
    The current Pegasos II with G4 is already an excellent computer.

    The new 7447 gives excellent perfomance and together with the high speed and low footprint of MOS
    the Pegasos II is a superp machine. And Genesi could easily produce today with no changes a Pegasos II with 1.5 Ghz G4. A 1.5 G4 with MOS will rock.

    On my opinion the very best thing for Genesi is on focusing to produce Pegasos II with G4.
    Freescale (ex Motorola) does not sleep. While we wait for MOS 1.5 - freescale is improving the G4.
    Soon there will be faster G4 available. The G4 still has room for improvements. Give freescale
    a few month time, let them produce G4 in 90nm instead of 130nm and you will get G4 with
    2.0 Ghz clockrate which might even run passivly. Those G4 will make very nice CPUs for current Pegasos.


    Cheers
    Gunnar
  • »25.07.04 - 10:47
    Profile Visit Website
  • Order of the Butterfly
    Order of the Butterfly
    dholm
    Posts: 296 from 2003/9/1
    From: Malmo, Sweden
    @Hammer:
    Did you know that the IA32 architecture has 12 general purpose registers whereas the PPC32 has 32?
    Did you know that 3/4th of the instructions in an average IA32 application accesses the memory due to the low number of registers? (Which is why you need really fast memory on a PC)
    Did you know that SSE specifies 8 128-bit registers whereas AltiVec has 32?
    Did you know that the IA32 was designed in the 1970's and that all processors based on it (including the P4) are backwards compatible with the first IA32 design?
    Did you know the PowerPC was designed in the late 1980's, early 1990's?

    This list can be made very long. Intel is pushing the clockrate further and further each year and is not much concerned with the relatively outdated CPU design. The reason for this is that most people don't know anything about processor design, they just assume that clockrate is everything, and therefore, clockrate sells.
    This does not apply to the IA64 which is based on a much more modern design (VLIW). This CPU is not target at the desktop market though.
  • »25.07.04 - 11:06
    Profile Visit Website
  • Caterpillar
    Caterpillar
    Posts: 26 from 2003/9/9
    There are some interesting articles on the PPC970
    here and here.
    One interesting quote from one of those two articles is the following...
    Quote:

    And if you consider the fact that the 970's power consumption at 1.2GHz is a mere 19W

    And that is for a 130 nm version, the 90 nm would be even better. So a downclocked G5 would not be such a bad idea.


    [ Edited by azalin on 2004/7/25 14:06 ]
  • »25.07.04 - 11:47
    Profile
  • Just looking around
    Hammer
    Posts: 15 from 2004/7/24
    @dholm

    My post was about the real clock speed within Pentium IV in relation to integer units. In another words Netburst architectures (“Rapid Execution Engine”™) needs those tricks to remain competitive with wide approach pipelined processors. Such tricks has yet to be applied to other processors.

    Quote:

    Did you know that the IA32 architecture has 12 general purpose registers whereas the PPC32 has 32?

    IA32 ISA only has 8 GPRs, 8 X87* and 8 XMM. While X86-64 has 16 GPRs, 8 X87* and 16 XMM.

    *With AMD arch
    _ K7-XP recycles X87 registers for 3DNow (SIMD). While "3DNow Pro" is just relabel of SSE (which uses XMM regs). A total of 16 SIMD registers (8 X87/3Dnow/MMX + 8 XMM).
    _K8's a total of 24 SIMD registers (8 X87/3Dnow/MMX + 16 XMM).

    On micro-architecture level, AMD’s K7 and K8 also has 8 additional registers for microcode engine. This is hidden from the programmer and exclusively use by micro-code engine.

    More registers are in register wheel for micro-architecture level improvements (i.e. reducing load/store memory access for storing register data closer to the processor).

    Quote:

    Did you know that 3/4th of the instructions in an average IA32 application accesses the memory due to the low number of registers? (Which is why you need really fast memory on a PC)


    Refer to K7 Athlon XP-M(with Bartons core) and VIA KT133A (MSI-6330) reasonable performance levels.

    Requirements for massive bandwidth is for Pentium IV(Netburst).

    Quote:

    Did you know that the IA32 was designed in the 1970's and that all processors based on it (including the P4) are backwards compatible with the first IA32 design?

    Note that modern X86 processors (since P6 and K5) doesn't directly execute X86-32 instructions i.e. they convert them into fix length RISC like instructions i.e. taking one of basic concepts of RISC. Such design doesn't exist in pre-P6/K5 X86 architecture.

    Please review RISC concepts and Post-RISC architectures in X86.

    Quote:

    Did you know the PowerPC was designed in the late 1980's, early 1990's?

    Note that same argument can be made from VILW view point against Pure-RISC, FRISC and Post-RISC.

    Quote:

    This list can be made very long. Intel is pushing the clockrate further and further each year and is not much concerned with the relatively outdated CPU design.


    That’s a stupid statement since you haven’t factored in ISA architecture and micro-architecture.

    The 8086 executes instructions at variable length while P4 executes fix length instructions (one of the basic RISC concepts)(after decoding) and register wheel and extra micro-coding registers increases on-chip registers (one of the basic RISC concepts). The 8086 micro-architecture is long dead.

    Note that; the custodian of X86 ISA is now with AMD’s hands since Intel has practically being a follower in X86’s next evolution.

    Please try again…

    [ Edited by Hammer on 2004/7/25 14:26 ]

    [ Edited by Hammer on 2004/7/25 14:29 ]
  • »25.07.04 - 12:49
    Profile
  • Just looking around
    Hammer
    Posts: 15 from 2004/7/24
    Quote:

    So what?
    Everyone knows this since years.


    The purpose of my post was for highlighting the true high clock speeds in P4 (netburst) architecture. This is due to Pentium IV’s being narrow approach. This trick was use to compete with wide approach processors.

    Quote:

    Revolutions != power!

    It’s IPC x clock speed. Pentium IV is extremely bias towards clock speed in expense of wide approach (related to IPC).

    Quote:

    Even Intel admitted that the race for higclockrates is a deadend.


    This is due to cooling/power issues and post-Prescott successor’s dead end.
    Using their +6Ghz fabrication processes now is eating their future reserve capabilities.
  • »25.07.04 - 13:08
    Profile
  • Caterpillar
    Caterpillar
    Posts: 26 from 2003/9/9
    @dholm
    Read this article.
    Note that the story is not about the P4 but of the architecture of P3 and below (so not about netburst).
    Quote:

    Modern x86 architectures have workarounds, like rename registers and a "free" FXCH instruction, for alleviating — but not eliminating — the performance disadvantages of x87's register-starved (only eight architectural registers) and stack-based architecture. The Pentium, however, had none of these, so it suffered mightily compared to its RISC competitors.

    Quote:


    So to summarize, the Pentium's entire front-end was bloated and distended with hardware that was there solely to support x86 (mis)features which were rapidly falling out of use. With transistor budgets as tight as they were, each of those extra address adders and prefetch buffers — not to mention the microcode ROM — represented a painful expenditure of scarce resources that did nothing to enhance performance.
    ...
    Today, x86 support accounts for well under 10% of the transistors on the Pentium 4 — a drastic improvement over the original Pentium, and one that has contributed significantly to the ability of x86 hardware to catch up to and even surpass their RISC competitors in both integer and floating-point performance. In other words, Moore's Curves have been extremely kind to the x86 ISA.
    ...
    Register renaming allows a processor to have a larger number of actual registers than the ISA specifies, thereby enabling the chip to do more computations simultaneously without running out of registers. Of course, there's some sleight-of-hand involved in fooling the program into thinking that it's using only eight registers, when it's really using up to 40


    The PPC uses register renaming as well by the way. The pentium M is based on the above described core.
  • »25.07.04 - 13:12
    Profile
  • Just looking around
    Hammer
    Posts: 15 from 2004/7/24
    Quote:

    @Read this article.

    Note that AMD's Athlon converts X87 FPU stack model into absolute FPU model i.e. AMD advances Intel’s post-RISC X86 designs to cover X87's weakness.

    [ Edited by Hammer on 2004/7/25 14:21 ]
  • »25.07.04 - 13:21
    Profile
  • Just looking around
    Hammer
    Posts: 15 from 2004/7/24
    Quote:

    Intel will focus again on chips like the Pentium M which isn't double pumped and doesn't
    reach the high clockrates of the Pentium IV but still outperforms much higher clocked P IVs.

    Depends on applications; refer to
    http://www.anandtech.com/mb/showdoc.aspx?i=2128
    (AMD's Socket 939 vs Intel's Socket 775)

    http://www.anandtech.com/cpuchipsets/showdoc.aspx?i=2129
    (Dothan investigated)

    [ Edited by Hammer on 2004/7/25 14:37 ]
  • »25.07.04 - 13:35
    Profile
  • Caterpillar
    Caterpillar
    Posts: 26 from 2003/9/9
    Those are some interesting links, thank you. By the way one of the reasons why I like the PPC970 (dubbed G5 by Apple) so much is the following:
    Quote:

    The 970's floating-point implementation is almost exactly like the G4e's, except there's twice as much hardware. The 970 has two identical FPUs, each of which can execute the fastest floating-point instructions (like the FMADD) in 6 cycles.

    Quote:

    And finally, the 970 has a larger number of FPRs: 80 total microarchitectural registers, where 32 are PowerPC architectural registers and the remaining 48 are rename registers.

    Quote:

    Furthermore, the 970's fast frontside bus (effectively 1/2 the core clock speed) and high-bandwidth memory subsystem (probably dual-channel DDR400) will give it a distinct advantage over the G4e
  • »25.07.04 - 13:59
    Profile
  • Acolyte of the Butterfly
    Acolyte of the Butterfly
    BigGun
    Posts: 150 from 2004/6/18
    From: Nagold - Germany
    Hi Hammer,

    Thanks for explaining your post.
    I think I misunderstood it. :-)


    It looks like we all agree that the PPC architecture is not too bad.


    Cheers
    Gunnar

    [ Edited by BigGun on 2004/7/25 22:33 ]
  • »25.07.04 - 20:32
    Profile Visit Website
  • Just looking around
    Hammer
    Posts: 15 from 2004/7/24
    I don’t recall applying any negative remarks on PPC MPUs. Available motherboards for purchase are another issue altogether.
  • »25.07.04 - 22:13
    Profile
  • Order of the Butterfly
    Order of the Butterfly
    minator
    Posts: 370 from 2003/3/28
    Quote:

    Note that P4’s integer units are double pumped i.e. 2X the rated clock speed e.g. P4@ 3.2 Ghz has integer units @6.4 Ghz.


    But...
    They can only work at that speed for 16 bit values, they did develop 32 bit versions at that speed but I don't think they've ever put them into production.

    In any case I think they were planning to remove the double clocked stuff.
  • »25.07.04 - 22:53
    Profile Visit Website
  • Acolyte of the Butterfly
    Acolyte of the Butterfly
    DethKnight
    Posts: 139 from 2003/6/24
    From: Central USA
    I wish they woould hurry up and meake an SMT version of this 970 for consumers




    edit
    ---------------------------------------------------------------------------------------

    well , I guess it pays to read sometimes......grrr

    http://www.theregister.co.uk/2004/07/26/ibm_ppc970mp/

    (seems motorola/freescale is doing it also)










    ----------------------------------
    The champ takes 6 in the Tour de Lance

    [ Edited by DethKnight on 2004/7/27 0:32 ]

    [ Edited by DethKnight on 2004/7/27 0:33 ]
    I am ; therefore you are
  • »26.07.04 - 02:32
    Profile
  • Caterpillar
    Caterpillar
    Posts: 26 from 2003/9/9
    There is a new article on Ars about netburst (P4).
    http://arstechnica.com/cpu/004/pentium-2/pentium-2-1.html

    Quote:

    The Pentium 4's designers took the P6's 12-stage pipeline and sliced it up into finer increments. Each stage does much less work, but this allows the processor to run faster...
    Let's say that each stage of a 20-stage processor does half the amount of work per clock cycle as each stage of a 10-stage processor. So the 20-stage processor takes two clock cycles to do what the 10-stage processor does in one. This means that the 20-stage processor has to run twice as fast as the 10-stage processor if it wants to do the same amount of work in the same amount of time. Why would you do things this way? Well, if people want to buy processors with higher clock speeds, then why not?
    Intel's reasoning went something like:
    1 process improvements
    2 clock speed increases
    3 profit!!!!



    From one of the articles about the PPC970 it seems the vector unit was "tacked on" so it is not as efficient as the one of the G4 per clock - however if your compiler schedules the instructions better it is as efficient, and because of the higher clock speeds this will speed it up.

    What we really need in the future is a PPC core with an XDR ram interface ( http://www.rambus.com/products/xdr/arch_overview.cfm ) and added vector units. An order of magnitude increase in RAM speed would mean processors will not be stalled as much because they are waiting on cache fetches.
  • »26.07.04 - 09:20
    Profile