Sr. Software Development Engineer- Machine Learning / Information Retrieval Job in Seattle 98119, Washington US
Sr. Software Development Engineer- Machine Learning / Information Retrieval
Sr. Software Dev Engineer, Information Retrieval / ML / Data Mining
Amazon.com's Product Ads team is looking for exceptional software engineers to develop algorithms and build systems to solve a variety of information retrieval, machine learning and data mining problems related to the Amazon Product Catalog using large scale distributed systems.
What we do:
Our team develops and employs innovative, cutting edge techniques in data mining, information retrieval and distributed computing to mine the Amazon product catalog to identify relationships between various offers provided by advertisers. We develop systems that build the intelligence of the Amazon's selection, which is the Earth's largest while maintaining a relentless focus on the shopping experience for which we are world renowned.
Our systems and algorithms operate on one of the world's largest product catalogs and it is quite routine for our systems to operate on Terabyte scale datasets using distributed frameworks such as Apache Hadoop(Map/Reduce) and other open source technologies such as Lucene. We consistently strive to improve the customer search and browse experience.
Our team is dedicated to finding similar products in order to place contextually relevant product advertisements on each page, as well as detecting egregious cases of poor quality data provided by sellers. We also build algorithms and systems to discover useful relationships between the billions of items that we have in our catalog. Examples of such relationships might include items that are variation of each other and items that are accessories of other items to list a few. Inferring these valuable relationships between products allows us to deliver an effective customer search and browse experience.
Who we are looking for:
We are looking for highly-motivated, innovative Software Engineers who thrive on challenging problems. As part of this team you will be developing new algorithms, building and maintaining large-scale distributed systems in Java/C++ using open source technologies such as Apache Lucene, Hadoop(Map-Reduce) and other Amazon.com proprietary technologies. You will need to analyze data to find patterns that may suggest new product features and inform your algorithm design.
About Product Ads:
Product Ads is a high-profile, strategic business unit, with support and interest from all parts of Amazon and top management. We are a highly motivated, collaborative and fun-loving team building a high growth business. We are entrepreneurial and have a bias for action with a broad mandate to experiment and innovate. This is an opportunity to make a significant impact on the future of the Amazon vision.
A successful candidate will have the satisfaction of seeing their work on the Amazon.com website and numerous other websites, where it will be viewed by tens of millions of customers and will drive quantifiable revenue impact, broaden their technical skills and work in an environment that thrives on creativity, experimentation, and product innovation.
- Bachelors degree in Computer Science with at least 3+ years of industry experience.
- Strong proven ability in building high-performance, highly-available and scalable distributed systems.
- Past experience or strong interest in at least one of the following areas - Information Retrieval, Data Mining, Machine Learning or Natural Language Processing.
- Extensive experience programming in Java/C++ on Unix Platforms.
- Experience with object oriented design and development.
- Proficiency with atleast one of these scripting languages: Perl / Python / Ruby / shell script.
- Masters degree in computer science with 5+ years of relevant industry experience or PhD.
- Familiarity with Pay for performance Ad model and the Internet advertisement industry.
- Familiarity with open source distributed computing frameworks such as Hadoop, HBase(Map/Reduce, Big Table) is a plus.
- Experienced in the design and development large systems. Ability to handle terabytes of data without flinching is a plus.
- Extensive experience in one or more of the following areas - Information Retrieval, Data Mining, Machine Learning or Natural Language Processing.
- SQL and data modeling skills.
- Highly innovative, flexible and self-directed.
- Excellent written and verbal communication skills.