Utilize statistical natural language processing to mine unstructured data, and create insights; analyze and model structured data using advanced statistical methods and implement algorithms and software needed to perform analyses
Build document clustering, topic analysis, text classification, named entity recognition, sentiment analysis, and part-of-speech tagging methods for unstructured and semi-structured data
Cluster and analyze large amounts of user-generated content and process data in large-scale environments using Amazon EC2, Storm, Hadoop, and Spark
Develop and perform text classification using methods such as logistic regression, decision trees, support vector machines, and maximum entropy classifiers
Develop methods to support and drive client engagements focused on Big Data and Advanced Business Analytics, in diverse domains such as product development, marketing research, public policy, optimization, and risk management; communicate results and educate others through reports and presentations
Perform text mining, generate and test working hypotheses, prepare and analyze historical data, and identify patterns
Requirements & Skills:
Four years of professional experience working in Natural Language Processing or related field
Experience with command-line scripting, data structures, and algorithms and ability to work in a Linux environment, processing large amounts of data in a cloud environment
Strong data extraction and processing, using MapReduce, Pig, and/or Hive preferred
Applicants must be currently authorized to work in the United States without the need for visa sponsorship now or in the future