Implement SSE2 floating-point support in the x86 native code generator (#594)