Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -1,5 +1,9 @@
# AB Testing

A/B testing is a method of comparing two versions of something (like a website, app, or marketing email) to see which one performs better. It works by randomly showing each version to different groups of people and then analyzing which version achieves the desired outcome, such as more clicks, higher conversion rates, or increased engagement. This allows data-driven decisions about which changes to implement.

Visit the following resources to learn more:

- [@article@Practitioner’s Guide to Statistical Tests](https://vkteam.medium.com/practitioners-guide-to-statistical-tests-ed2d580ef04f#1e3b)
- [@article@Step by Step Process for Planning an A/B Test](https://medium.com/data-science/step-by-step-for-planning-an-a-b-test-ef3c93143c0b)
- [@feed@Explore top posts about A/B Testing](https://app.daily.dev/tags/ab-testing?ref=roadmapsh)
- [@feed@Explore top posts about A/B Testing](https://app.daily.dev/tags/ab-testing?ref=roadmapsh)
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
# Classic/Advanced ML
# Machine Learning Algorithms

Machine learning algorithms are the core tools used to enable computers to learn from data without being explicitly programmed. These algorithms identify patterns, make predictions, and improve their performance over time through experience. They are broadly categorized into classic methods like supervised (where the algorithm learns from labeled data) and unsupervised learning (where the algorithm discovers patterns in unlabeled data), and more advanced techniques like ensemble methods (combining multiple models for better accuracy) and neural networks (complex algorithms inspired by the structure of the human brain).

Visit the following resources to learn more:

- [@opensource@Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop](https://github.com/gerdm/prml)
- [@article@Open Machine Learning Course](https://mlcourse.ai/book/topic01/topic01_intro.html)
- [@article@Coursera: Machine Learning Specialization](https://imp.i384100.net/oqGkrg)
- [@article@Pattern Recognition and Machine Learning by Christopher Bishop](https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf)
- [@opensource@Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop](https://github.com/gerdm/prml)
- [@feed@Explore top posts about Machine Learning](https://app.daily.dev/tags/machine-learning?ref=roadmapsh)
- [@article@Pattern Recognition and Machine Learning by Christopher Bishop](https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf)
Original file line number Diff line number Diff line change
@@ -1,5 +1,10 @@
# Data Structures and Algorithms
# Data Structures and Algorithms (Python)

Data structures are ways of organizing and storing data in a computer so that it can be used efficiently. Algorithms are step-by-step procedures for solving a problem, often involving manipulating data within these structures. Using Python, common data structures include lists, dictionaries, trees, and graphs, while algorithms encompass searching, sorting, and optimization techniques. These concepts provide the foundational building blocks for creating efficient and scalable solutions.

Visit the following resources to learn more:

- [@roadmap@Visit the Dedicated DSA Roadmap](https://roadmap.sh/datastructures-and-algorithms)
- [@article@Learn Algorithms](https://leetcode.com/explore/learn/)
- [@article@Leetcode - Study Plans](https://leetcode.com/studyplan/)
- [@article@Algorithms Specialization](https://imp.i384100.net/5gqv4n)
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
# Data Understanding, Analysis and Visualization
# Data Understanding, Data Analysis, and Visualization

Data understanding involves gathering initial insights about a dataset, including its source, content, and potential issues. Data analysis then uses various techniques to extract meaningful information, identify patterns, and test hypotheses within the data. Finally, data visualization presents these findings in a graphical format, making complex data more accessible and understandable for decision-making.

Visit the following resources to learn more:

- [@article@Exploratory Data Analysis With Python and Pandas](https://imp.i384100.net/AWAv4R)
- [@article@Exploratory Data Analysis for Machine Learning](https://imp.i384100.net/GmQMLE)
- [@article@Python for Data Visualization: Matplotlib & Seaborn](https://imp.i384100.net/55xvzn)

- [@article@Python for Data Visualization: Matplotlib & Seaborn](https://imp.i384100.net/55xvzn)
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
## Deep Learning
# Deep Learning

Deep learning is a subset of machine learning that deals with algorithms inspired by the structure and function of the brain called artificial neural networks. Deep learning is a key technology behind driverless cars, enabling them to recognize a stop sign, or to distinguish a pedestrian from a lamppost. It is the key to voice control in consumer devices like phones, tablets, TVs, and hands-free speakers. Deep learning is getting lots of attention lately and for good reason. It’s achieving results that were not possible before.
Deep Learning is a subset of machine learning that uses artificial neural networks with multiple layers (hence "deep") to analyze data and learn complex patterns. These networks are inspired by the structure and function of the human brain, and they are particularly effective at tasks like image recognition, natural language processing, and speech recognition. By processing data through these layers, deep learning models can automatically extract features and make predictions with high accuracy.

Visit the following resources to learn more:

- [@roadmap@Visit the Dedicated Machine Learning Roadmap](https://roadmap.sh/machine-learning)
Original file line number Diff line number Diff line change
@@ -1,5 +1,9 @@
# MLOps
# Deployment Models and CI/CD

Deployment models define how a trained machine learning model is put into production, making it accessible for real-world use. Continuous Integration and Continuous Delivery (CI/CD) are practices used to automate the process of building, testing, and deploying these models, enabling faster and more reliable updates to your AI systems.

Visit the following resources to learn more:

- [@article@Machine Learning Engineering for Production (MLOps) Specialization](https://imp.i384100.net/nLA5mx)
- [@article@Full Stack Deep Learning](https://fullstackdeeplearning.com/course/2022/)
- [@feed@Explore top posts about CI/CD](https://app.daily.dev/tags/cicd?ref=roadmapsh)
- [@feed@Explore top posts about CI/CD](https://app.daily.dev/tags/cicd?ref=roadmapsh)
Original file line number Diff line number Diff line change
@@ -1,4 +1,8 @@
# Differential Calculus

Differential calculus is a branch of mathematics concerned with the study of rates at which quantities change. It involves finding derivatives, which represent the instantaneous rate of change of a function with respect to its input variable. Key concepts include limits, continuity, differentiation rules, and applications such as optimization and related rates.

Visit the following resources to learn more:

- [@article@Algebra and Differential Calculus for Data Science](https://imp.i384100.net/LX5M7M)
- [@video@Calculus Youtube Course](https://www.youtube.com/playlist?list=PLZHQObOWTQDMsr9K-rj53DwVRMYO3t5Yr)
- [@video@Calculus Youtube Course](https://www.youtube.com/playlist?list=PLZHQObOWTQDMsr9K-rj53DwVRMYO3t5Yr)
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
# Econometrics

Econometrics is the application of statistical methods to economic data. It is a branch of economics that aims to give empirical content to economic relations. More precisely, it is "the quantitative analysis of actual economic phenomena based on the concurrent development of theory and observation, related by appropriate methods of inference." Econometrics can be described as something that allows economists "to sift through mountains of data to extract simple relationships."
Econometrics is the application of statistical methods to economic data. It is a branch of economics that aims to give empirical content to economic relations. More precisely, it is "the quantitative analysis of actual economic phenomena based on the concurrent development of theory and observation, related by appropriate methods of inference." Econometrics can be described as something that allows economists "to sift through mountains of data to extract simple relationships."
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
# Fully Connected NN, CNN, RNN, LSTM, Transformers, Transfer Learning
# Neural Network Architectures

These are different types of neural networks, each designed with a specific structure to excel at particular tasks. Fully Connected Networks (also known as Dense Networks) have every neuron connected to every neuron in the adjacent layers. Convolutional Neural Networks (CNNs) are particularly effective for image recognition by using convolutional layers to extract features. Recurrent Neural Networks (RNNs) are designed for sequential data, like text or time series, by maintaining a hidden state that captures information about past inputs. Long Short-Term Memory networks (LSTMs) are a special type of RNN that mitigate the vanishing gradient problem and are better at capturing long-range dependencies in sequential data. Transformers rely on the attention mechanism to weigh the importance of different parts of the input sequence, proving highly effective in natural language processing. Finally, Transfer Learning (TL) involves leveraging knowledge gained from solving one problem to a different but related problem, improving efficiency and performance.

Visit the following resources to learn more:

- [@article@The Illustrated Transformer](https://jalammar.github.io/illustrated-transformer/)
- [@article@Attention is All you Need](https://arxiv.org/pdf/1706.03762.pdf)
- [@article@Deep Learning Book](https://www.deeplearningbook.org/)
- [@article@Deep Learning Specialization](https://imp.i384100.net/Wq9MV3)

- [@article@Deep Learning Specialization](https://imp.i384100.net/Wq9MV3)
Original file line number Diff line number Diff line change
@@ -1,4 +1,8 @@
# Hypothesis Testing

Hypothesis testing is a statistical method used to determine whether there is enough evidence in a sample of data to infer that a certain condition is true for an entire population. It involves formulating a null hypothesis (a statement of no effect or no difference) and an alternative hypothesis (the statement you are trying to find evidence for), then using sample data to assess the likelihood of observing the data if the null hypothesis were true. Based on this likelihood, a decision is made to either reject the null hypothesis in favor of the alternative, or fail to reject the null hypothesis.

Visit the following resources to learn more:

- [@article@Introduction to Statistical Analysis: Hypothesis Testing](https://imp.i384100.net/vN0JAA)
- [@feed@Explore top posts about Testing](https://app.daily.dev/tags/testing?ref=roadmapsh)
- [@feed@Explore top posts about Testing](https://app.daily.dev/tags/testing?ref=roadmapsh)
Original file line number Diff line number Diff line change
@@ -1,9 +1,10 @@
# Increasing Test Sensitivity

Test sensitivity refers to the ability of a test to correctly identify individuals who truly have the condition or characteristic being tested for. It essentially measures the proportion of true positives that are correctly identified by the test. Increasing test sensitivity means improving the test's ability to detect more true positives, reducing the number of false negatives (cases where the test incorrectly says someone doesn't have the condition when they actually do).

Visit the following resources to learn more:

- [@article@Minimum Detectable Effect (MDE)](https://splitmetrics.com/resources/minimum-detectable-effect-mde/)
- [@article@Improving the Sensitivity of Online Controlled Experiments: Case Studies at Netflix](https://kdd.org/kdd2016/papers/files/adp0945-xieA.pdf)
- [@article@Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-Experiment Data](https://exp-platform.com/Documents/2013-02-CUPED-ImprovingSensitivityOfControlledExperiments.pdf)
- [@article@How Booking.com increases the power of online experiments with CUPED](https://booking.ai/how-booking-com-increases-the-power-of-online-experiments-with-cuped-995d186fff1d)
- [@article@Improving Experimental Power through Control Using Predictions as Covariate — CUPAC](https://doordash.engineering/2020/06/08/improving-experimental-power-through-control-using-predictions-as-covariate-cupac/)
- [@article@Improving the Sensitivity of Online Controlled Experiments: Case Studies at Netflix](https://www.researchgate.net/publication/305997925_Improving_the_Sensitivity_of_Online_Controlled_Experiments_Case_Studies_at_Netflix)

- [@article@Improving the Sensitivity of Online Controlled Experiments: Case Studies at Netflix](https://www.researchgate.net/publication/305997925_Improving_the_Sensitivity_of_Online_Controlled_Experiments_Case_Studies_at_Netflix)
Original file line number Diff line number Diff line change
@@ -1,5 +1,10 @@
# Python
# Learn Python Programming Language

Python is a versatile, high-level programming language known for its readability and extensive libraries. It supports multiple programming paradigms, including object-oriented, imperative, and functional styles. Python's syntax emphasizes code clarity, making it easier to learn and use, and its vast ecosystem of packages caters to a wide range of applications.

Visit the following resources to learn more:

- [@roadmap@Visit the Dedicated Python Roadmap](https://roadmap.sh/python)
- [@article@Kaggle — Python](https://www.kaggle.com/learn/python)
- [@article@Google's Python Class](https://developers.google.com/edu/python)
- [@feed@Explore top posts about Python](https://app.daily.dev/tags/python?ref=roadmapsh)
- [@feed@Explore top posts about Python](https://app.daily.dev/tags/python?ref=roadmapsh)
Original file line number Diff line number Diff line change
@@ -1,4 +1,10 @@
# SQL
# Learn SQL

SQL, or Structured Query Language, is the standard language for interacting with databases. It allows you to retrieve, update, and manage data stored in relational database management systems (RDBMS) like MySQL, PostgreSQL, and SQL Server. You use SQL to write queries to filter, sort, and aggregate data, as well as to define and manipulate the structure of the database itself.

Visit the following resources to learn more:

- [@roadmap@Visit Visit the Dedicated SQL Roadmapthe](https://roadmap.sh/sql)
- [@course@Master SQL with Roadmap](https://roadmap.sh/courses/sql)
- [@article@SQL Tutorial](https://www.sqltutorial.org/)
- [@feed@Explore top posts about SQL](https://app.daily.dev/tags/sql?ref=roadmapsh)
- [@feed@Explore top posts about SQL](https://app.daily.dev/tags/sql?ref=roadmapsh)
Original file line number Diff line number Diff line change
@@ -1,5 +1,9 @@
# Learn Algebra, Calculus, Mathematical Analysis
# Linear Algebra, Calculus, and Mathematical Analysis

Linear algebra deals with vectors, matrices, and linear transformations, providing the foundation for representing and manipulating data in higher dimensions. Calculus focuses on continuous change, covering concepts like derivatives and integrals, which are essential for optimization and modeling. Mathematical analysis rigorously studies the concepts underlying calculus, such as limits, continuity, and sequences, providing a deeper understanding of the theoretical underpinnings of many algorithms.

Visit the following resources to learn more:

- [@article@Mathematics for Machine Learning Specialization](https://imp.i384100.net/baqMYv)
- [@video@Linear Algebra Youtube Course](https://www.youtube.com/playlist?list=PLZHQObOWTQDPD3MizzM2xVFitgF8hE_ab)
- [@feed@Explore top posts about Math](https://app.daily.dev/tags/math?ref=roadmapsh)
- [@feed@Explore top posts about Math](https://app.daily.dev/tags/math?ref=roadmapsh)
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# Machine Learning

Machine learning is a field of artificial intelligence that uses statistical techniques to give computer systems the ability to "learn" (e.g., progressively improve performance on a specific task) from data, without being explicitly programmed. The name machine learning was coined in 1959 by Arthur Samuel. Evolved from the study of pattern recognition and computational learning theory in artificial intelligence, machine learning explores the study and construction of algorithms that can learn from and make predictions on data – such algorithms overcome following strictly static program instructions by making data-driven predictions or decisions, through building a model from sample inputs. Machine learning is employed in a range of computing tasks where designing and programming explicit algorithms with good performance is difficult or infeasible; example applications include email filtering, detection of network intruders, and computer vision.
Machine learning involves creating algorithms that allow computer systems to learn from data without being explicitly programmed. These algorithms identify patterns, make predictions, and improve their performance over time through experience. The learning process can be supervised, unsupervised, or reinforcement-based, depending on the type of data and the desired outcome.

Visit the following resources to learn more:

- [@roadmap@Visit the Dedicated Machine Learning Roadmap](https://roadmap.sh/machine-learning)
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# Mathematics

Mathematics is the foundation of AI and Data Science. It is essential to have a good understanding of mathematics to excel in these fields.
Mathematics provides the foundational language and tools for understanding and building algorithms. It encompasses concepts like linear algebra (for manipulating data), calculus (for optimization), probability and statistics (for understanding uncertainty and drawing inferences), and discrete mathematics (for reasoning about data structures and algorithms). These mathematical principles enable the development, analysis, and interpretation of models used in artificial intelligence and data science.

Visit the following resources to learn more:

- [@book@Linear Algebra Done Right - Sheldon Axler](https://linear.axler.net/LADR4e.pdf)
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# MLOps

MLOps is a practice for collaboration and communication between data scientists and operations professionals to help manage production ML lifecycle. It is a set of best practices that aims to automate the ML lifecycle, including training, deployment, and monitoring. MLOps helps organizations to scale ML models and deliver business value faster.
MLOps is a practice for collaboration and communication between data scientists and operations professionals to help manage the production ML lifecycle. It is a set of best practices that aims to automate the ML lifecycle, including training, deployment, and monitoring. MLOps helps organizations to scale ML models and deliver business value faster.

Visit the following resources to learn more:

- [@roadmap@Visit the Dedicated MLOps Roadmap](https://roadmap.sh/mlops)
Original file line number Diff line number Diff line change
@@ -1,4 +1,7 @@
# Econometrics Pre-requisites
# Pre-requisites of Econometrics

- [@article@10 Fundamental Theorems for Econometrics](https://bookdown.org/ts_robinson1994/10EconometricTheorems/)
Econometrics uses statistical methods to analyze economic data and test economic theories. Before diving into econometrics, a solid foundation in several areas is needed. This includes a grasp of basic mathematics like calculus and linear algebra, a strong understanding of probability and statistics, and familiarity with economic principles. These pre-requisites provide the necessary tools for understanding econometric models and interpreting their results.

Visit the following resources to learn more:

- [@article@10 Fundamental Theorems for Econometrics](https://bookdown.org/ts_robinson1994/10EconometricTheorems/)
Loading