DATA SCIENCE & ENGINEERING. ART

The data science team combines coding and math’s.

As a result you will take data driven decisions which will impact all parts of your business and deliver real growth.

WHAT WE DO

  • Data Science

    Our data science team brings their programming knowledge and mathematical skills coupled with domain experience to solve business problems.

    They delve in artificial intelligence, machine learning and other supervised or unsupervised models depending on the business problem.

  • Data Engineering

    Our engineering team provides the data framework that enables our scientists and analysts to solve business problems.

    We have made significant progress in this are over the past 12 months, investing in the analytics server and the new MarTech environment.

Check out some of our Projects below:

MLOPS

Machine Learning Operation is the combination of 3 items: DevOps + DataOps + Model Ops. It combines proper governance e.g. version control, logs and lineage of the main components of a Machine Learning powered application: Code, Data and ML models.

By using Databricks platform over Azure we were able to migrate all our ML powered applications to a fully governed and scalable framework using Pyspark and Mlflow.

MLOps compliant pipelines are easy to monitor and maintain, leaving the team able to develop more models to support revenue growth and cost reduction objectives.

Sentiment layout

Data is available either via direct connection to Databricks (internal use) or via public REST API and that allowed us to visualise this datapoint in combination with other useful information such as pricing.

Simply overlaying these 2 data points can give actionable data to our customers and internal analysts.

Data is currently used on email communications and internally via PowerBI reports.

Sentiment Backend in Databricks

Using Databricks were able to combine historical data (from the EDS tables) and the streaming data from the execution venue to calculate near real time sentiment on our markets for all Service Offering. Currently the data includes G2 and MT4. In Q3 FY2024 MT5 will be added into the live stream.

The data points are refreshed every 10 mins and combine trading and customer data so that sentiment can be segmented by Value Tier or regulatory classification.

ACQUISITION DASHBOARD

Provide end to end data from impression to trading so you can understand the ROMI of each channel.

We applied "last paid channel" attribution and the data model can give a comprehensive view of both organic and paid channels.

Per advertisement cost is also included in the Dashboard with a daily granularity.

Data is refreshed every day thanks to our scripts running on Airflow.

GCLID PROJECT

Thanks to our Airflow instance we are able to periodically (every 3 hours, planning to decrease to 1 hour) communicate to the Google Advertisement Platform (Google Campaign Manager) which customers went on to trade or fund.

This deep conversion data point is then used by Google to further optimize our advertisement.

PERSONAS DASHBOARD

Using unsupervised k-means clustering we identified 4 core groups of our customer base.

This allowed us to understand other characteristics such as demographics features. The data-driven study of Personas allowed better targeting in acquisition and also more customized journeys in CRO and CVM.

Persona description is available in the following systems: Google Analytics, Monetate, DWH and Salesforce

  • Segmentation

    Segments are used to divide the target market into manageable groups to offer a tailored experience of our products and services

  • Marketing ROI

    We evaluate the marketing touchpoints a consumer encounters on their path to conversion and link spend to revenue

  • Lifetime Value

    Projecting lifetime value of a client is important for acquisition, retention sales and value management teams.

  • Propensity Models

    Be it lead scoring for new acquisition or churn models, propensity models are key to provide a targeted service to a Retail client base

Meet the Team

  • Elettra Damaggio

    DIRECTOR OF DATA SCIENCE & ENGINEERING

    Doing anything that Anirban doesn’t want to do.

  • Sumeyra Karaca

    DATA SCIENTIST

    Working on NLP models. Projects: Text to Analytics.

  • Agata Plewa

    DATA SCIENTIST

    Working on Customer Behavior Forecasting. Projects: Life time value, Personas, Churn Prediction

  • Chitra Pun

    DATA ENGINEER

    Supporting the team into migrating in the new Martech Platform

  • Hela Momand

    DATA ENGINEER

    Supporting the team into migrating in the new Martech Platform

  • Cigdem Tuncer

    DATA ENGINEER

    Supporting the team into migrating in the new Martech Platform

“The analytics team help us identify various commercial opportunities and we are excited to see what they have in store for us in 2022!”

— Sixto Alonso, VP Americas, Retail

Core competencies

Machine Learning

Using supervised and unsupervised machine learning techniques to solve business problems. Examples of what we used so far: K-means clustering, Neural Networks and XGBOOST random forests.

Data Integration / Engineering

Building ETL pipelines to collect, transform and process data from our various data sources in cloud and on premises, supporting commercial reporting and ML modules. The Data Engineering team also builds two-way connectors to make data points available where is most useful for the business.

Languages / Platforms

We code in: Python, SQL, Big Query SQL

We use: PowerBI, AirFlow, DAX, Google Cloud, sklearn and scikit learn, REST API.

Automation

Data automation for end to end reporting via Airflow (task scheduler).

We can develop automated process for Data treatment and consumption.

Let’s connect to discuss how analytics can improve performance


Elettra Damaggio
Director, Data Science
elettra.damaggio@stonex.com