點擊上方藍色字體關注公眾號
這周兩篇文章:
最近有人問有沒有相關數據集,這幾天抽時間整理了以下數據集,標題即是Kaggle競賽題目,可以直接搜索獲得賽題詳細介紹,在此列出10個參賽隊伍最多的競賽題及標籤,最重要的是提供數據集的下載。
Kaggle是提升理解ML的較好平臺,學的再多,都不如現在開始動手實踐,簡歷上寫的會再多算法,都不如有1個競賽TOP3有說服力。
1 Titanic: Machine Learning from Disaster
Start here!
Predict survival on the Titanic and get familiar with ML basics
2 House Prices-Advanced Regression Techniques
Predict sales prices
practice feature engineering, RFs, and gradient boosting
3 Digit Recognizer
CV starts here!
Learn computer vision fundamentals with the famous MNIST data
4 TalkingData AdTracking Fraud Detection Challenge
fraudulent click starts here!
Can you detect fraudulent click traffic for mobile app ads?
5 Toxic Comment Classification Challenge
NLP starts here!
Identify and classify toxic online comments
6 Santander Customer Satisfaction
HOT
Which customers are happy customers?
7 2018 Data Science Bowl
CV
Find the nuclei in divergent images to advance medical discovery
8 Bike Sharing Demand
Forecasting
Forecast use of a city bikeshare system
9 Instacart Market Basket Analysis
選品分析
Which products will an Instacart consumer purchase again?
10 San Francisco Crime Classification
多分類預測
Predict the category of crimes that occurred in the city by the bay
後臺回覆:kaggledata 直接下載。若不反感,可否點下廣告!