Hi Mellos,
in my experience, this is normal for the first few iterations, as the cars seem to just be doing ‘random inputs’ and whichever results in the highest score gets reinforced; the cars need to ‘learn’ that ‘forward gets you more points’
Hi Mellos,
in my experience, this is normal for the first few iterations, as the cars seem to just be doing ‘random inputs’ and whichever results in the highest score gets reinforced; the cars need to ‘learn’ that ‘forward gets you more points’