Unstructured Data Specialist

Job Locations US-NY-New York
Posted Date 6 years ago(8/27/2014 3:30 PM)
# of Openings
Information Technology


SoluStaff is actively recruiting a Data Mining Specialist for a large healthcare organization based in New York City.  The desired candidate must have advanced hands-on expertise in unstructured data mining environments.


  • Explore millions of records – develop, implement, and optimize algorithms to run in real-time / near real-time
  • Develop solutions for analytics and visualization of structured & unstructured data sets
  • Improve scalability performance of existing storage and analytics solutions
  • Ability to mine large sets of structured, semi-structured and unstructured data


  • Experience with data mining (decision tree, logistic regression, cluster analysis, etc.) that includes accessing and analyzing large volumes of data
  • Advanced knowledge of data analysis processes and tools

  • Exposure to a variety of information architecture themes such as MDM, data warehousing, data quality, business intelligence, metadata management, content management, Big Data/NoSQL,etc., and ability to apply concepts in the company context
  • 6-8 years of experience working with large-scale systems and very large data sets
  • Expertise in developing search query commands and data analytics is a must
  • Strong experience in data mining methods and techniques
  • Expertise with mining tools such as Oracle Data Mining (ODM), SQL Server DM, R
  • Strong programming experience in SQL. Proven hands-on coding skills
  • Knowledge of statistics is a plus
  • Strong technical background in programming and experience working hands-on with large-scale data sets

  • Deep knowledge in developing and troubleshooting large scale distributed systems
  • Ability to solve complex problems in a fast paced environment with limited guidance
  • An eye for quality and a willingness to do what is necessary to achieve deadlines in a dynamic environment with frequent priority changes is required.
  • Experience with creating ETL processes to source and link data

  • Scaling, performance and scheduling and ETL techniques

  • Assist in developing internal tools for data analysis

  • Strong proficiency for analyzing source data and creating staging designs and dimensional data models

  • Responsible for Search Semantic/Text Analytics and Optimization program research and deployment. Apply linguistics & technical skills to mine text and extract entities, concepts or sentiment, and discover knowledge from both unstructured and structured data sources.

  • Able to work efficiently in teams and/or as an individual
  • Good oral and written communication skills.
  • Bachelor’s degree or higher in Computer Science, Engineering, Mathematics, or a related discipline


Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed