Alexeys branch (The first link) is in active development, that might be why its running slower. The official NVIDIA branch (The 2nd link) is the one I have integrated into my branch, I have yet to test any performance, but it might be worth trying the official NVIDIA branch one and see if it does infact perform differently to Alexeys