Instead of relying on base/cpu.h, copy its implementation of __cpuid for GCC/Clang (from base/cpu.cc) and use the original SSE2 detection code from ANGLE using __cpuid.