Media Cloud Data Architect

Massachusetts, United States
02 Aug 2019
End of advertisement period
01 Oct 2019
Contract Type
Full Time

The University of Hong Kong 's highest priorities are to create opportunities for the very best academic talents to excel and to advance human knowledge to the benefit of society. We serve the needs of Hong Kong, the wider region and the rest of the world.

Working at MIT offers opportunities, an environment, a culture  and benefits that just aren’t found together anywhere else. If you’re curious, motivated, want to be part of a unique community, and help shape the future then take a look at this opportunity.

MEDIA CLOUD DATA ARCHITECT, Media Lab Center for Civic Media, to join Media Cloud, a project building and maintaining data-centric media analysis tools that investigate and track how hate speech moves across the internet. The archive includes over one-billion stories with more than 700,000 added daily. Will participate in the full life cycle of application development, enabling collection and processing of these stories and making them available to researchers via an API.  Responsibilities include evaluating, designing, building, and maintaining back-end server architecture and data pipeline including creating technical specifications and test scenarios; establishing and communicating a technical vision for data architecture; collaborating on developing and implementing a technical roadmap; writing new code to scale systems to handle rapidly growing data requirements, architecting code for scalability; building, maintaining, and upgrading systems within existing codebase; communicating project status; and other duties as assigned. 

Job Requirements

REQUIRED: at least five years’ experience working as a software engineer/architect on big data-related projects and working with text-based data system (i.e., NLP), PostgreSQL databases, or Solr databases; programming fluency in Python; experience writing, maintaining, and optimizing SQL queries against large databases and scaling platforms to handle large data sets; experience implementing and maintaining a production ETL pipeline; history of crafting/building/testing/deploying robust code; ability to iterate quickly through prototypes, using data to validate architectural decisions; interest in working on issues related to hate-speech/democracy/gender/race/health; and experience working on diverse teams and with different disciplines.  Bachelor’s degree, Perl programming fluency, experience writing web crawlers or API scrapers, ability to scale platforms to handle more users, desire to solve difficult engineering and data problems, and knowledge of/interest in social sciences preferred.  Job #17813-9

This is a one-year position with possibility of extension based on funding and research priorities.  

MIT is an equal employment opportunity employer. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of race, color, sex, sexual orientation, gender identity, religion, disability, age, genetic information, veteran status, ancestry, or national or ethnic origin.