Go to Quantitative Methods

Topics

Table of Contents



Introduction




How Is Fintech Used in Quantitative Investment Analysis?


Note

The term Fintech comes from combining Finance and Technology.

Fintech refers to technological innovation in the design and delivery of financial products and services.

Its earlier forms involved data processing and automation of routine tasks. It later advanced into decision-making applications based on complex machine learning logic.

The major drivers of fintech have been:

We focus on quantitative analysis in the investment industry:

Big Data


Big Data refers to vast amount of data generated by industry, governments, individuals, and electronic devices.

Characteristics of big data

Pasted image 20250909190514.png

Data Volume Key Bytes of Info
MB MegaByte 1 M
GB GigaByte 1 B
TB TeraByte 1 T
PB PetaByte 1 Q
In addition to these three V’s, a fourth V is becoming increasingly important, especially when using big data for drawing inferences or making predictions.
  1. Veracity – Credibility and reliability of different data sources.

Big Data can be structured (can be organized in tables), semi-structured, or unstructured (cannot be represented in a tabular form).

Sources of Big Data

Individuals Biz Process Sensors
Social Media, Reviews Transaction Data Satellites
News Corporate Data Geolocation
Web Searches IoT
Personal Data Other Sensors
Big Data Challenges


Advanced Analytical Tools: Artificial Intelligence and Machine Learning


Artificial Intelligence

Example

Chess playing computer programs, Digital assistants like Apple’s Siri, etc.

Machine Learning

Computer-based techniques that extract knowledge from large amounts of data by learning from known examples and then generating structure or predictions without relying on any help from a human.

In ML, the dataset is divided into three distinct subsets:

  1. Training dataset: Allows the algorithm to identify relationships between inputs and outputs based on historical patterns in the data.
  2. Validation dataset: Used to validate and model tune the relationships.
  3. Test dataset: Used to test the model’s ability to predict well on new data.

Broadly speaking there are three main approaches to machine learning:

  1. Supervised learning: Both inputs and outputs are labeled. After learning from labeled data, the trained algorithm is used to predict outcomes for new data sets.
  2. Unsupervised learning: Input and output variables are not labeled. The ML algorithm has to seek relationships on its own.
  3. Deep learning: Neural networks are used by the computers to perform multistage, non-linear data processing to identify patterns.
Tip

For ML to work well, good human judgment is required.

Human judgment is required for questions like:

  • Which data to use
  • How much data to use
  • Which analytical techniques are relevant
  • Clean and filter the data

Deep learning algorithms are used for image, pattern, and speech recognition.

Some challenges associated with machine learning are:



Tackling Big Data with Data Science


Data science leverages advances in computer science, statistics, and other disciplines for the purpose of extracting information from Big Data.

Data Processing Methods


Data Visualization


How the data will ultimately be presented to the analyst/user.

Data Visualization
Graphs, Charts → Heat Maps, Tree Diagrams, and Tag Clouds

Example

  • Heat map of a city where routes with high traffic congestion are shown in red.
  • A tag cloud works on textual data where words that appear more often are shown in larger font.
    Pasted image 20250909201501.png

Text Analytics and Natural Language Processing


Text Analytics
Use of computer programs to derive meaning from large, unstructured text or voice-based data. Based on this, we can determine if the sentiment is very positive, positive, neutral, or negative.

Natural Language Processing (NLP)
Application of text analytics whereby computers analyze and interpret human language. Such processing is possible because of access to Big Data and processing power.

Example

  • Gauge the consumer sentiment about a new product by analyzing what is being said about the product.
  • Used to gain insights in communications from policy makers who use a more formal tone.