Our company provides internship opportunities by partnering with global that offers hands-on experience in data science, data analysis, machine learning, modelling simulation and natural language.

Data Science internships involve techniques in data mining, statistical learning, predictive modeling, mathematical and simulation modeling, forecasting, data visualization, text analytics, social media analytics and natural language processing. An internship in this field is often immersed in development of systems for natural language analysis, classification tasks, information retrieval, machine translation and processing of financial data. Interns may be provided with an opportunity to solve business problems by using data, programming, mathematical and statistical skills; or by using critical thinking to optimize processes, recommend courses of action among different scenarios and drive business success and profitability.

To be successful in this role, an intern must have knowledge in conducting hypothesis testing, regression analysis, forecasting and data mining, and processing and manipulating data to generate reproducible results. Interns will also need skills in project management, build predictive models and machine-learning algorithms. During the internship, interns may enhance skills in regards to developing predictive models and simulations using a variety of software and tools, presenting information using data visualization techniques and identifying valuable data sources and automate collection processes.


Data Analysis

Develop and implement data analyses, data collection systems and other strategies that optimize statistical efficiency and quality. Acquire data from primary or secondary data sources and maintain databases. Provide assistance with assessing tests and implementing new or upgraded software and assisting with strategic decisions on new systems. Prepare and generate reports from single or multiple systems. Provide technical expertise on data storage structures, data mining and data cleansing. Assist in developing static and interactive data visualizations.

Machine Learning

Conduct original research in specific Machine Learning areas. Solve business problems through machine learning, data mining and statistical algorithms. Assist with data visualization and presentation. Assist with evaluating, revising and improving Machine Learning systems based on quantitative metrics. Provide assistance with extracting, transforming and cleaning large (multi-TB) data sets in a Unix/Linux environment.

Modelling and Simulations

Assist in the completion of non-routine and advanced tasks. Assist with the analysis, investigation and solution of non-routine problems; and assist with developing electronic and hard copy documentation as required. Develop scripts to facilitate management and configuration of software used in a complex flight simulator platform for virtual, man-in-the-loop battlespace simulations. Provide assistance with streamlining the installation and integration of component hardware and software in the flight simulator platform. Set up a terrain server to feed 3D terrain graphics to applications. Automate the audio-visual feeds from the virtual flight simulator to external audio-visual destinations. Assist with employment of advanced data modeling and forecasting techniques to explore strategic business opportunities

Natural Language Processing (NLP)

Process and analyze vast amounts of clinical data to help create a highly scalable infrastructure to house the billions of records from the ground up. Develop innovative methods for processing and storing data. Interrogate analytical results to resolve algorithmic success, robustness and validity. Design and implement secure, scalable and fault-tolerant solutions building production quality and large-scale deployment of applications related to natural language processing and machine learning. Develop prototype ideas and solutions; then perform critical analysis and creatively solve complex problems. Build document clustering, topic analysis, text classification, named entity recognition, sentiment analysis and part-of-speech tagging methods for structured and semi-structured data.


