'Data Science' 카테고리의 글 목록 (4 Page)

Notice

Recent Posts

Recent Comments

Link

« 2024/09 »
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Tags more

Archives

Today

Total

관리 메뉴

목록Data Science (47)

Code&Data Insights

[Machine Learning] Association Rule Learning - Apriori | Eclat Algorithm

[ What is Association Rule Learning? ] Association Rule Learning : Association Rule Learning is a data mining technique that discovers rules indicating the co-occurrence of two or more items. => Identifiy the relationships between items and discovers valuable rules indicating their co-occurrence. => For example, People who bought 'this stuff', they also bought 'this stuff'. | "You may also like”..

Data Science/Machine Learning 2023. 6. 19. 09:31

[Machine Learning] Clustering - Hierarchical Clustering | Agglomerative Hierarchical Clustering | Dendrograms

[ Hierarchical Clustering ] Hierarchical Clustering : Hierarchical clustering is a data analysis technique that groups data hierarchically based on similarity or distance - Use Euclidean distance or Manhattan distance - 2 approachs for hierarchical Clustering : 1) Agglomaerative- Top-down 2) Divisive - Bottom-up [ Agglomerative Hierarchical Clustering ] ( Agglomerative Hierarchical Clustering : ..

Data Science/Machine Learning 2023. 6. 19. 05:54

[Machine Learning] Classification - K-Nearest Neighbours(KNN) | Naive Bayes

[ K-Nearest Neighbours ] K Nearest Neighbors (KNN) => KNN is a supervised learning classifier, which uses proximity to make classifications or predictions about the grouping of an individual data point. How It Works? Step 1) Choose the number K of neighbors Step 2) Take the K nearst neighbors of the new data point, according to the Euclidean distance - Euclidean Distance : √((x₂ - x₁)² + (y₂ - y..

Data Science/Machine Learning 2023. 6. 16. 09:00

[Machine Learning] Ensemble Learning | Random Forest

[ Ensemble Learning ] Ensemble Learning : when we take multiple algorithms or the same algorithm multiple times and we put them together that results in a much more powerful version. => It helps to improve the performance and accuracy of machine learning algorithms. => Putting multiple ML algorithms together to create one bigger ML algorithm that leverages many other ML algorithms. Type of Ensem..

Data Science/Machine Learning 2023. 6. 15. 07:43

[Machine Learning] Linear Model - Linear Regression | Logistic Regression | Multiclass Logistic Regression | Linear Basis Function Models

Linear Model - Easy to optimize, fast training and prediction - Good Interpretability - ONLY suitable for linearly separable classes => The capacity of the linear model depends on the input dimensionality D. => VC dimensions : D + 1 for Logistic regression VC dimension? : a measure of the capacity or complexity of a hypothesis space Linear Regression - Parameter space is convex - Objective funct..

Data Science/Machine Learning 2023. 6. 14. 07:53

[Multiple Linear Regression] 5 methods of building models | Stepwise Regression

In Multiple Iinear Regression Model, there are many variables. To build a model, we need to choose right variables ! ( Using all the variables given in the data, it's NOT a good idea ) [ 5 methods of building models ] 1. All-in - Prior knowledge - Preparing for Backward Elimination 2. Backward Elimination Step 1) Select a significance level to stay in the model (ex) SL = 0.05 ----> SL = Signific..

Data Science/Machine Learning 2023. 6. 13. 09:10

[Mathematics of Data Management ] study notes | basic concepts related to data analytics

[ Type of Data ] [ Sampling Methods ] 1) Random Sampling : sample choice made without any pattern and would be completely unrelated 2) Simple Random Sampling : all of the selections are equally likely, for example drawing one name and each name has the same chance of being selected 3) Systematic Random Sampling : more organized in sample selection, create pattern to choose the samples 4) Stratif..

Data Science/Data Analytics 2023. 6. 13. 03:14

[Pandas] Pandas DataFrame | Series | Index | Basic APIs

Pandas : a Python library used for working with data sets. -> Pandas has functions for analyzing, cleaning, exploring, and manipulating data. [ DataFrame ] DataFrame : a Pandas DataFrame is a 2 dimensional data structure, like a 2 dimensional array, or a table with rows and columns in RDB(relational database-SQL) [ Series ] Series : Series is a one-dimensional array holding data of any type, lik..

Data Science/Data Analytics 2023. 6. 2. 08:25

Prev 1 2 3 4 5 6 Next

목록Data Science (47)

Code&Data Insights

티스토리툴바