Data Scientist

From: Bhagya Raj <bhagyaraj3sbc_at_gmail.com>
Date: Fri, 26 Jul 2019 07:27:37 -0700 (PDT)
Message-ID: <7d6ea8fd-15af-49f3-98bc-93495ad6ee26_at_googlegroups.com>



Role:Data Scientist

Location:Plano, TX

Duration:Long-Term

Primary Job Responsibilities

Ø Assess the deployment of data science environment constraints

Ø Analyze and recommend tools that meet system requirements

Ø Select the development environment

Ø Create and Configure an Azure data science environment

Ø Define technical success metrics and quantify risks

Ø Transform data into usable datasets through development of data structures, designing a data sampling strategy, designing the data preparation flow

Ø Perform Exploratory Data Analysis (EDA) through review of visual analytics data to discover patterns and determine next steps, identify anomalies, outliers, and other data inconsistencies and create descriptive statistics for a dataset

Ø Cleanse and transform data, resolve anomalies, outliers, and other data inconsistencies, standardize data formats and set the granularity for data

Ø Perform feature extractionalgorithms on numerical data, non-numerical data and scale features

Ø Perform feature selection, define the optimality criteria and apply feature selection algorithms

Ø Develop models:

ü Select an algorithmic approach, determine appropriate performance metrics implement appropriate algorithms, consider data preparation steps that are specific to the selected algorithms

ü Split datasets, determine ideal split based on the nature of the data, determine number of splits, determine relative size of splits, ensure splits are balanced

ü Identify data imbalances, resample a dataset to impose balance, adjust performance metric to resolve imbalances, implement penalization

ü Train the model, select early stopping criteria, tune hyper-parameters

ü Evaluate model performance, score models against evaluation metrics, implement cross-validation, identify and address overfitting, identify root cause of performance results

Basic Qualifications (Minimum):

Ø Masters (minimum) or PhD (preferred), or advanced degree in Computer Science or related field

Ø 3-4 years (with MS) or 1-2 years (with PhD) of experience manipulating data sets and building statistical models using statistical computer languages (R, Python, SQL, Scala etc.)

Ø Knowledge of advanced statistical techniques and concepts (regression, distributions, statistical tests and proper usage, etc.) and experience with their application

Ø Knowledge of a variety of machine learning techniques (clustering, decision trees, artificial neural networks, etc.) and their real-world advantages and drawbacks

Ø Knowledge and experience with Deep Neural Net modeling frameworks– TensorFlow, PyTorch, Caffe etc. are preferable

Thanks & Regards

BhagyaRaj |Sr.Recruiter

T 281-823-9222x580 | bhagyaraj.bakuru_at_3sbc.com

Gtalk : bhagyaraj3sbc_at_gmail.com.  

linkedin cid:image001.png_at_01CFB7F6.5D9EA6B0 Received on Fri Jul 26 2019 - 16:27:37 CEST

Original text of this message