From d5fd5c7ad38860a1668164e4f4a00e6db54d8bb1 Mon Sep 17 00:00:00 2001 From: Thanassis Tsiodras Date: Sun, 10 Jul 2022 18:40:02 +0200 Subject: [PATCH] Mention the branch with the AVX work. --- README | 14 ++++++++------ README.md | 12 ++++++++---- 2 files changed, 16 insertions(+), 10 deletions(-) diff --git a/README b/README index aea1c78..8a1573c 100644 --- a/README +++ b/README @@ -27,8 +27,6 @@ Make sure you have libSDL installed - then... $ ./configure $ make -You can then simply... - $ src/mandelSSE -h Usage: ./src/mandelSSE [-a] [-s|-x] [-h] [-f rate] [WIDTH HEIGHT] @@ -96,10 +94,14 @@ Over the last two decades, I kept coming back to this, enhancing it. algorithm [inside a 1.4$ microcontroller]( https://www.youtube.com/watch?v=5875JOnFDLg). -- And finally (?), in October 2020, I implemented what I understood to be - the XaoS algorithm - that is, re-using pixels from the previous frame - to optimally update the next one. Especially in deep-dives and large - windows, this delivers amazing speedups. +- In October 2020, I implemented what I understood to be the XaoS algorithm; + that is, re-using pixels from the previous frame to optimally update + the next one. Especially in deep-dives and large windows, this delivered + amazing speedups. + +- In July 2022, I optimised further with AVX instructions (+80% speed + in CoreLoopDouble). The code exists in a separate branch + (https://github.com/ttsiodras/MandelbrotSSE/tree/AVX). CODERS ONLY =========== diff --git a/README.md b/README.md index bb00812..8a1573c 100644 --- a/README.md +++ b/README.md @@ -94,10 +94,14 @@ Over the last two decades, I kept coming back to this, enhancing it. algorithm [inside a 1.4$ microcontroller]( https://www.youtube.com/watch?v=5875JOnFDLg). -- And finally (?), in October 2020, I implemented what I understood to be - the XaoS algorithm - that is, re-using pixels from the previous frame - to optimally update the next one. Especially in deep-dives and large - windows, this delivers amazing speedups. +- In October 2020, I implemented what I understood to be the XaoS algorithm; + that is, re-using pixels from the previous frame to optimally update + the next one. Especially in deep-dives and large windows, this delivered + amazing speedups. + +- In July 2022, I optimised further with AVX instructions (+80% speed + in CoreLoopDouble). The code exists in a separate branch + (https://github.com/ttsiodras/MandelbrotSSE/tree/AVX). CODERS ONLY ===========