Implementation

This page covers and coordinates implementing SV. The basic concept is to go step-by-step through the overview adding each feature, one at a time. Caveats and notes are included so that other implementors may avoid some common pitfalls.

Links:

Code to convert

There are four projects:

  • TestIssuer (the HDL)
  • ISACaller (the python-based simulator)
  • power-gem5 (a cycle accurate simulator)
  • Microwatt (VHDL)

Each of these needs to have SV augmentation, and the best way to do it is if they are all done at the same time, implementing the same incremental feature.

Critical tasks

These are prerequisite tasks:

People coordinating different tasks. This doesn't mean exclusive work on these areas it just means they are the "coordinator" and lead:

  • Lauri:
  • Jacob: C/C++ header for using SV through inline assembly
  • Cesar: TestIssuer FSM
  • Alain: power-gem5
  • Cole:
  • Luke: ISACaller, python-assembler-generator-class
  • Tobias:
  • Alexandre: binutils-svp64-assembler
  • Paul: microwatt

Adding SV

order: listed in overview

svp64 decoder

An autogenerator containing CSV files is available so that the task of creating decoders is not burdensome. sv_analyse.py creates the CSV files, SVP64RM class picks them up.

  • ISACaller: part done. svp64 detected, PowerDecoder2 in use
  • power-gem5: TODO
  • TestIssuer: part done. svp64 detected, PowerDecoder2 in use.
  • Microwatt: TODO
  • python-based assembler-translator: 40% done (lkcl)
  • c++ macros: underway (jacob)

Links:

SVSTATE SPR needed

This is a peer of MSR but is stored in an SPR. It should be considered part of the state of PC+MSR because SVSTATE is effectively a Sub-PC.

Chosen values, fitting with v3.1B p12 "Sandbox" guidelines:

num name    priv  width
704,SVSTATE,no,no,32
720,SVSRR0,yes,yes,32

Progress:

sv.setvl

a setvl instruction is needed, which also implements sprs i.e. primarily the SVSTATE SPR. the dual-access SPRs for VL and MVL which mirror into the SVSTATE.VL and SVSTATE.MVL fields are not immediately essential to implement.

  • LibreSOC OpenPOWER wiki fields/forms: DONE. pseudocode: TODO
  • ISACaller: TODO
  • power-gem5: TODO
  • TestIssuer: TODO
  • Microwatt: TODO

SVSRR0 for exceptions

SV's SVSTATE context is effectively a Sub-PC. On exceptions the PC is saved into SRR0: it should come as no surprise that SVSTATE must be treated exactly the same. SVSRR0 therefore is added to the list to be saved/restored in exactly the same way and time as SRR0 and SRR1. This is fundamental and absolutely critical to view SVSTATE as a full peer of PC (CIA, NIA).

  • ISACaller: TODO
  • power-gem5: TODO
  • TestIssuer: TODO
  • Microwatt: TODO

Illegal instruction exceptions

Anything not listed as SVP64 extended must raise an illegal exception if prefixed. setvl, branch, mtmsr, mfmsr at the minimum.

  • ISACaller: TODO
  • power-gem5: TODO
  • TestIssuer: TODO
  • Microwatt: TODO

VL for-loop

main SV for-loop, as a FSM, updating SVSTATE.srcstep, using it as the index in the for-loop from 0 to VL-1. Register numbers are incremented by one if marked as vector.

This loop goes in between decode and issue phases. It is as if there were multiple sequential instructions in the instruction stream and the loop must be treated as such. Specifically: all register read and write hazards MUST be respected; the Program Order must be respected even though and especially because this is Sub-PC execution.

This includes any exceptions, hence why SVSTATE exists and why SVSRR0 must be used to store SVSTATE alongside when SRR0 and SRR1 store PC and MSR.

Due to the need for exceptions to occur in the middle, the loop should not be implemented as an actual for-loop, whilst recognising that optimised implementations may do multi-issue element execution as long as Program Order is preserved, just as it would be for the PC.

Remember the following register files need to have for-loops, plus unit tests:

  • GPR
  • SPRs (yes, really: mtspr and mfspr are SV Context-extensible)
  • Condition Registers. see note below
  • FPR (if present)

When Rc=1 is encountered in an SVP64 Context the destination is different (TODO) i.e. not CR0 or CR1. Implicit Rc=1 Condition Registers are still Vectorised but do not have EXTRA2/3 spec adjustments. The only part if the EXTRA2/3 spec that is observed and respected is whether the CR is Vectorised (isvec).

Increasing register file sizes

TODO. INTs, FPs, CRs, these all increase to 128. Welcome To Vector ISAs.

At the same time the Rc=1 CR offsets normslly CR0 and CR1 for fixed and FP scalar may also be adjusted.

Single Predication

TODO

Element width overrides

TODO