목록Artificial Intelligence/Data Analytics (5)
Code&Data Insights
Main Steps in Machine Learning Project 1. Look at the big picture 2. Get the data 3. Discover and visualize the data to gain insights 4. Prepare the data for ML algorithms (data cleaning, preprocessing) 5. Select a model and train it 6. Fine-tune the model 7. Present your solution 8. Launch, monitor, and maintain your system 1. Look at the Big Picture - Frame the Problem(business objective) : wh..
: the process of identifying and connecting records or data entries that correspond to the same real-world entity or individual in one or more data sources. - Improves data quality and integrity - Fosters re-use of existing data sources - Optimize space [ Atomic String Similarity ] Atomic String Similarity, why it is important? - Information Retrieval : similarity of string - Da..
[ Cohort Analysis ] Cohort analysis breaks the data in a data set into related groups before analysis. - a kind of behavior analytics - a group of subjects which share a defining feature and observing the behaviour of the group(cohort) over time and compare it to other cohorts. * Main Stages for Cohort Analysis 1) Determine what question you want to answer to improve business, product, user expe..
[ Type of Data ] [ Sampling Methods ] 1) Random Sampling : sample choice made without any pattern and would be completely unrelated 2) Simple Random Sampling : all of the selections are equally likely, for example drawing one name and each name has the same chance of being selected 3) Systematic Random Sampling : more organized in sample selection, create pattern to choose the samples 4) Stratif..
Pandas : a Python library used for working with data sets. -> Pandas has functions for analyzing, cleaning, exploring, and manipulating data. [ DataFrame ] DataFrame : a Pandas DataFrame is a 2 dimensional data structure, like a 2 dimensional array, or a table with rows and columns in RDB(relational database-SQL) [ Series ] Series : Series is a one-dimensional array holding data of any type, lik..