Meta Processing

MIPS32 1074K Coherent Processing System


Home entertainment consumer products are rapidly changing, driven by new ways in which consumers use and interact with these devices. Web connectivity is a technology migrating into products in your digital living room, extending from traditional access using a PC or laptop computer. Your flat panel TV is now becoming a portal to a much wider variety of web-based media and content, as well as enabling services such as video chat sessions with friends and family.

These are compelling new use models revolutionizing traditional products in the digital living room, and they require new levels of performance in the underlying platforms they run on. To address these top end performance challenges, the new MIPS32® 1074K™ Coherent Processing System (CPS) is the latest coherent multiprocessor IP offering from Imagination. The 1074K CPS sets a new standard of performance within MIPS' portfolio of licensable processor IP, delivering over 15,000 CoreMark and up to 12,000 DMIPS in 40nm technology at 1.5 GHz (worst case slow corner conditions with production margins). The 1074K CPS is available as fully-synthesizable multicore IP that achieves high frequencies using commercially-available, non-custom, standard cells and memories. The 1074K CPS enables easy implementation of very high-performance multicore systems, maximum flexibility in configuration for specific features, and the ability to migrate your design across foundries, process nodes, and geometries.

The 1074K CPS is based on the combination of two high-performance technologies—coherent multiprocessing, and the superscalar, Out-of-Order (OoO) MIPS32 74K® processor core as the base CPU. The 74K is a multi-issue, 15-stage OoO architecture already in production with numerous customers for digital televisions, set-top boxes and a variety of home networking applications. It is broadly used in internet-connected digital home products. To further optimize the core for this growing trend, Imagination recently enhanced the 74K core to deliver as much as 30% additional performance on JavaScript and Web applications. With the 74K core at its heart, the 1074K CPS is an ideal high-performance platform for today's SoC designs, with headroom for tomorrow's designs as well.

MIPS32 1074K Summary

  • Foundry process independent, fully-synthesizable, multiprocessor IP supporting very high frequency implementations with commercial, non-custom standard cells and memories
  • Cache coherent multiprocessing system supporting configurations with up to 4 cores
  • Uses enhanced version of 15-stage superscalar, OoO 74K series core as base CPU, making it an ideal high-performance multiprocessing platform for Web-connected consumer products and highly-integrated home/residential gateways
    • Provides additional 20-30% per core performance on JavaScript and Web-connected applications relative to earlier releases of 74K core RTL
  • Supported by a broad ecosystem of application software and operating systems

MIPS32 1074K Features

A complete system for coherent multiprocessing, including:

  • 1 to 4 1074K "base" cores
    • 1074K base core = 74K superscalar, out-of-order high-performance processor with 15-stage pipeline, and cache coherence structures added for interface to Coherence Management unit
  • Coherence Management (CM) unit – high throughput coherence fabric supporting 256-bit wide buses internally on key datapaths, as well as external read and write data interfaces to L2 cash and through to rest of system logic in SoC implementation
  • I/O Coherence Unit (IOCU) – hardware acceleration for I/O coherence, offloading software implementation on CPUs
  • Cluster Power Controller (CPC) – multicore power gating, clock gating, and reset management
  • Global Interrupt Controller (GIC) – system and inter-processor interrupt controller
  • EJTAG/PDtrace™ block for advanced debug/trace of complete coherent system

Floating Point Unit (FPU)

  • MIPS 32® 1074Kf™ core version has IEEE 754-compliant FPU per core, compliant to MIPS® 64-bit FPU architecture
  • Supports single- and double-precision data types
  • Separate in-order, dual-issue pipeline decoupled from integer pipeline in each core

Coherency Management (CM) Unit

  • Manages coherency using the MESI protocol
  • Maximized throughput for multicore cluster supported by:
    • 256-bit wide read and write buses (4x bus width between each core and CM) on key internal datapaths
    • 256-bit wide read and write interfaces from CM to L2 cache and through to system interconnect
    • L1 cache-to-cache transfers, speculative reads to external memory, and globalized cache operations
  • Global Configuration Registers (GCRs) for configuring/controlling CM scheme

Cluster Power Controller (CPC)

  • Provides highly scalable performance/power management via shutdown and bring up of one or more cores in the coherent processing system via clock or voltage island control
  • Works in conjunction with each core implemented in a separate power domain

I/O Coherence Unit (IOCU)

  • Bridges non-coherent I/O peripheral transfer and makes transactions coherent
  • Supports per-transaction attributes for snooping L1 caches, L1+L2 caches, or non-coherent transactions, plus I/O prioritization

Global Interrupt Controller (GIC)

  • Supports system-level interrupts; inter-processor interrupts
  • Routes interrupts to a particular core or VPE
  • Configurable # of system interrupts (up to 256)

Development Tools

  • MIPSR Navigator ICS – Integrated Development Environment (IDE), software toolkit, cycle accurate and instruction accurate simulator models, EJTAG and PDtrace probes
  • CodeSourcery - SG++ toolchains for MIPS

MIPS32 1074K Specifications

Base Core: 1074Kf (with FPU) 1074Kf (with FPU)
Configuration: dual coredual core
Process: 40nm G (TSMC) 40nm G (TSMC)
Libraries: TSMC 12 track, SVt TSMC 12 track, MVt/OD
Frequency: > 1.2 GHz 1.5 GHz
Performance  
 Coremark/MHz (per core)2.552.55
 DMIPS/MHz (per core)2.032.03
Power: 0.36 mW/MHz 0.43 mW/MHz
Area: 4.1 mm2 4.1 mm2

Both implementations optimized for speed (area and power-optimized specifications available upon request).

Achieved using free standard cells from TSMC and memories from Dolphin; quoted speeds are at worst case slow/slow corner, with signal integrity and production margins of 10% OCV and 25 ps clock jitter.

Note:
Frequency, power consumption and size depend upon configuration options, synthesis, silicon vendor and process and cell libraries.

Dual core configuration: 2 cores + coherence management unit with hardware IO coherence. Each core configured with 32K/32K caches, 32 entry dual TLB, and hardware floating point unit.



Cookies on the
Imagination website
We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the Imagination website. However, if you would like to, you can change your cookie settings at any time.