PrimeDrift Corrected evaluation of GPT4 performance on primality testing OPENAI_API_KEY=XXXX python evaluate.py