*** lx0 is now known as lxo | 10:04 | |
lkcl | twin +/- FMAC is working, that's phase 2 of the DCT/DFT augmentations | 19:12 |
---|---|---|
programmerjake | yay! | 22:28 |
lkcl | it's quite fascinating, how many uses FFT/DCT/DFT has, and really quite scary the amount of code and assembler needed | 23:14 |
lkcl | the wikipedia page for Cooley-Tukey does warn though that solving the bit-reverse LOAD isn't the full picture | 23:14 |
lkcl | there's also the fact that the L1/L2 caches get hit by "distributed" data | 23:15 |
lkcl | and of course, it's all on power-of-two boundaries | 23:15 |
lkcl | which means that there's the risk of hitting the exact same cache line for the array data | 23:16 |
lkcl | hilarously, finding efficient simple *scalar* algorithms is challenging | 23:17 |
Generated by irclog2html.py 2.17.1 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!