A software core for the C64 it's possible. Unlike other implementations based on programmable logic (FPGA) and soft-cores, this is a 32-bit microcontroller running a 6510 emulator. So I call it software-core, no soft-core.
For easy reconfigurability, a portable C programed microprocessor emulator is used resulting in a "High-Level Languaje In-Circuit Emulator".
This is a spacetime emulator use, space because in-circuit, and time because real-time operation (software running synchronized to an extrernal clock).
Watch the videos about a C64 with the 6510 microprocessor replaced with a 32-bit microcontroller: ARM Cortex M3 LPC1769. There are running games and IEC operation (via Uno2IEC).
As you can see, emulation it's not limited to 6510 processor only. Why not implement another CPU emulators for an heterogeneous multi-core C64. In this next post view 6510 and Z80 emulators on ARM for cartridgeless C64 CP/M: Dual Core C64
miker00lz's EhBasic@Arduino |
Inspiration:
On Sept 2015 I saw miker00lz’s post at Arduino forum about runnig his 6502 emulator (fake6502) on Arduino (https://forum.arduino.cc/index.php?topic=193216.0).
That was interesting but I thought it would be better if the microcontroller handle external SRAM, CIA or SID.
EhBasic@LPC1769 debug console |
ARM:
Thinking that at some point Arduino would limit performance I choose a more powerfull platform: an NXP LPC1769, ARM Cortex 3. The LPC also has 5V tolerant GPIO pins needed by MOS chips.
First test was with a 32k x 8 SRAM (HM62256) and fake6502 running EhBasic.
//MEMORY READ
uint8_t read6502(uint16_t address){
FIO2SET |= (1 << 10); // Pone en H RW* (p2.10) Lectura
FIO2PINH= address << 2; // Escribe la dirección A0 a A13 en P1.18 a P1.31 (bus de direcciones)
FIO2SET |= (1 << 11); // Pone en H CS (p2.11)
value = FIO2PIN0; // Lee bus de datos (P2.0 a P2.7)
FIO2CLR |= (1 << 11); // Pone en L CS (p2.11)
RETURN (value);
}
//MEMORY WRITE
void write6502(uint16_t address, uint8_t value){
FIO2PINH= address << 2; // Escribe la dirección A0 a A13 en P1.18 a P1.31 (bus de direcciones)
FIO2PIN0= value; // Escribe bus de datos (P2.0 a P2.7)
FIO2CLR |= (1 << 10); // Pone en L RW* (p2.10) Escritura
FIO2SET |= (1 << 11); // Pone en H CS (p2.11)
FIO2CLR |= (1 << 11); // Pone en L CS (p2.11)
}
The pins connection was direct and the emulator ran asynchronously, EhBasic code was, like on the Arduino, contained as a C constant array in the microcontroller flash program memory.
Then I wanted to reproduce 6502 operation reading program from a ROM, that is an external stored program, external to the microcontroller. One first option was to write EhBasic to a 29FXXX DIP Flash but I prefered to use C64 ROMs, so I retired from a C64 a socketed 901225 Characters ROM chip, connected it like the SRAM to the microcontroller and readed it.
And then, one more step, what about connecting the microcontroller to the C64 replacing the original 6510 and test ROM, CIAs and SID with address decoding provided by the original PLA.
It seems trivial but presents several difficulties because in the C64 the 6510 microprocessor shares the buses with the VIC video chip, the system RAM is DRAM refreshed by the VIC chip and we need to emulate 6510’s I/O port.
6510’s I/O port is mapped at $0001 address and, in C64, bits 0,1 and 2 connected to LORAM, HIRAM and CHAREN signals. PLA use this signals for switching between ROM and RAM for $A000-$BFFF, $D000-$DFFF and $E000-$FFFF memory areas.
This operation was implemented with 3 GPIO pins and catchs at write function:
// 6510 I/O Port
if (address==0x1) {
if ((value & 0x1)==0){
FIO2CLR = (1 << 11); // Pone en L LORAM
}
else
{
FIO2SET = (1 << 11); // Pone en H LORAM
}
if ((value & 0x2)==0){
FIO2CLR = (1 << 12); // Pone en L HIRAM
}
else
{
FIO2SET = (1 << 12); // Pone en H HIRAM
}
if ((value & 0x4)==0){
FIO2CLR = (1 << 13);
}
else
{
FIO2SET = (1 << 13); // Pone en H CHAREN
}
}
VIC is out |
VIC:
As VIC provides CLK for CPU up to here the emulator runs asynchronously. In order to place the VIC on the board and share buses the emulator needs third state capability and synchronization to system clock.
Interface:
That operation was implemented using a custom interface adapter using discrete logic: 3 x 74HC245 Octal 3−State Noninverting Bus Transceivers for address and data bus and a 74HC00 for R/W. Two remaining 74HC00 gates were used to accommodate required delay between in and out 6510 clock signals.
Also read and write functions were rewritten in order to take into account AEC and RDY states.
As 6510 only can access buses when AEC=HIGH all read and write operations are synchronized to that signal and not to Clk (Fi2).
Timmings:
As VIC provides CLK for CPU up to here the emulator runs asynchronously. In order to place the VIC on the board and share buses the emulator needs third state capability and synchronization to system clock.
In the C64 bus access is driven by VIC with it’s BA signal connected to 6510’s AEC pin. All microprocessor read and write operations take place when AEC=1 and must be enabled by the RDY signal too. When AEC=0 VIC uses the buses and microprocessor pins must go to third state.
For more datails please see: http://www.zimmers.net/cbmpics/cbm/c64/vic-ii.txt
For more datails please see: http://www.zimmers.net/cbmpics/cbm/c64/vic-ii.txt
Interface:
That operation was implemented using a custom interface adapter using discrete logic: 3 x 74HC245 Octal 3−State Noninverting Bus Transceivers for address and data bus and a 74HC00 for R/W. Two remaining 74HC00 gates were used to accommodate required delay between in and out 6510 clock signals.
As 6510 only can access buses when AEC=HIGH all read and write operations are synchronized to that signal and not to Clk (Fi2).
//MEMORY READ (SHARED BUS)
uint8_t externalread(uint16_t address) {
uint8_t value;
while ((FIO2PIN1&1)==0) {}; // Espera mientras AEC=0 -> Fi0(P2.8)=0
while ((FIO2PIN1&1)==1) {}; // Espera mientras AEC=1 -> Fi0(P2.8)=1
FIO1PINH =(address << 2); // Escribe la dirección A0 a A13 en P1.18 a P1.31 (bus de direcciones)
FIO4PINH= (address >> 2); // A14 y A15 en P4.28 y P4.29
while ((FIO0PIN3&16)==0){}; // Si RDY=0 (P0.28) espero aquí
while ((FIO2PIN1&1)==0) {}; // Espera mientras AEC=0 -> Fi0(P2.8)=0
while ((FIO2PIN1&1)==1) {}; // Espera mientras AEC=1 -> Fi0(P2.8)=1
value = FIO2PIN0; // Lee bus de datos (P2.0 a P2.7)
// DEBUGOUT ("<> %x %x\n",address,value);
return(value);
}
//MEMORY WRITE (SHARED BUS)
void externalwrite(uint16_t address, uint8_t value) {
while ((FIO2PIN1&1)==0) {}; // Espera mientras AEC=0 -> Fi0(P2.8)=0
FIO2PIN0 = value; // Escribe bus de datos (P2.0 a P2.7)
while ((FIO2PIN1&1)==1) {}; // Espera mientras AEC=1 -> Fi0(P2.8)=1
////////////////////FLANCO DESCENDENTE////////////////////
FIO1PINH =(address << 2); // Escribe la dirección A0 a A13 en P1.18 a P1.31 (bus de direcciones)
FIO4PINH= (address >> 2); // A14 y A15 en P4.28 y P4.29
FIO2SET = (1 << 10); // Pone en H W (P2.10) Escritura
FIO2DIR0 = 0xff; // Bus de datos como Salida
while ((FIO2PIN1&1)==0) {}; // Espera mientras AEC=0 -> Fi0(P2.8)=0
while ((FIO2PIN1&1)==1) {}; // Espera mientras AEC=1 -> Fi0(P2.8)=1
////////////////////FLANCO DESCENDENTE////////////////////
FIO2PIN0 = value; // Escribe bus de datos (P2.0 a P2.7)
FIO2DIR0 = 0x00; // Bus de datos como Entrada
FIO2CLR = (1 << 10); // Pone en L W (P2.10) Lectura
}
Timmings:
VIC chip was connected to the board and several time adjustments were made to read/write routines.
Incorrect timmings produced DRAM corruption during VIC refresh and access.
Incorrect timmings produced DRAM corruption during VIC refresh and access.
Ram Corruption |
Error: 38911./909 Bytes Free |
Finally I got the system running, a C64 with a software-emulated microprocessor!
Testing;
The shared bus access works fine even with the lower performance obtained, about 60%, respect to the original 1MHz 6510.
Of course better performance can be achieved using an assembly emulator, like a6502 (https://github.com/BigEd/a6502), but the idea is a "High-Level Languaje In-Circuit Emulator".
More photos
More photos
Utterly brilliant, really well done.
ResponderEliminarDon't suppose your releasing the project at all are you?
What do you mean by "catches"? Caches? Latch? Level shifter?
ResponderEliminarhi telmomoya
ResponderEliminarnice project -but only 60% cpu performance power?
we will wait for a fast mcu like a cheap and fast pi pico from you