Data Engineer - Software

Shanghai, Shanghai
03 Oct 2016
23 Nov 2016
Oil and Gas
Contract Type
Full Time
Database Engineer is to implement data warehousing solutions to support cloud-based software development efforts in GE Renewable Energy. The successful candidate will own the technical solutions needed for data ingestion, storage, access and provisioning to enable large scale software and analytics deployment for our customers. Data Engineer will work closely with IT, software product/project managers, software engineers, and GE Digital's Predix teams to deliver successful outcomes.

Essential Responsibilities

In this role you will:
• Develop practical and innovative ways to implement data warehousing solutions, including data ingestion & cleaning, data organization & storage and data access to support scalable cloud-based software development and deployment
• Deliver solutions in a modern "big data" environment of relational and time series databases to support multiple classes of synchronous and asynchronous real-time data
• Effectively manage data velocity, volume and variety in a data intensive infrastructure business
• Track installations, provide regular application updates and own the overall data lifecycle for the database environments
• Collaborate with customers to integrate their data with GE infrastructure
• Work with in data scientists, IT experts, software engineers and data support personnel to provision and monitor solutions
• Support software development efforts by architecting and implementing data solutions that interface with modern analytics environments
• Support data mining, analysis and data presentation efforts for customer-facing software development
• Work with internal customers to define data dictionaries
• Introduce team to new data technologies and look for opportunities for performance improvements
• Work on cross-function teams of software engineers, data scientists, product and project managers and IT experts to deliver leading edge software products to the Wind industry
• Leverage global team's expertise and develop specific data purge and analytical solutions for the improvement of China OEM turbines' performance and asset management; expand GE's digital solutions to offer China customers one-stop solution for their fleet assets.


• Bachelor's Degree in Computer Science or in "STEM" Majors (Science, Technology, Engineering and Math) from an accredited college or university
• Minimum 3 years of experience with relational and time series databases
• Demonstrated ability to develop software with a heavy focus on data manipulation

Desired Characteristics

• Master's degree in Engineering or Computer Science
• Demonstrated experience developing and deploying software in large scale applications
• Strong background in data warehousing technologies (relational, time series, NoSQL) ... PostgreSQL, Greenplum, Apache Spark, Historian, Hadoop
• Expert with SQL, Python, Java, Matlab, R
• Experience with SQL database administration
• Superior analytical skills, with ability to improve / automate existing processes
• Experience in statistics, machine learning, or other data mining techniques
• Agile development training or experience preferred
• Six Sigma training is preferred
• Ability to work both autonomously and as part of a team
• Strong oral and written communication skills
• Ability to understand internal & external customer needs
• Strong interpersonal and leadership skills
• Understanding of Wind turbine control/SCADA data is a plus