Data Scientist Resources 2013

Big Data is still a hot talk for business intelligence and data mining people in 2013. In my previous post, i have researched about big data search trends in 2012. FYI, data scientist is a practitioner of data science. Below are top 10 resources to kick start becoming a data scientist by exploring big data online resources:

Data Science Central – the industry’s online resource for big data practitioners, including information about the latest in technology, tools and trends.
How to be a Data Scientist – article in Smart Data Collective describing set of skills you should have if you want to do data science.
Free Big Data Education – article in Big Data Republic that listed free online courses (MOOC) which you can take toward obtaining the requisite background for becoming a data scientist.
Data Science Tutorials – list of tutorials by Kaggle to perform data analysis using data scientist’s toolkit.
Data Science News in Social Media – compilation of latest news about data science.
Data Science Wikibooks – open book with a very basic introduction to data science.
CODATA – the International Council for Science (ICSU), which works to improve the quality, reliability, management and accessibility of data. Also resource for Data Science Journal.
GigaOM Big Data – latest big data tech stories.
5 Big Data Predictions for 2013 – some of the key big data themes to dominate 2013.
Top 5 Data Science Bloggers – article that listed top 5 data science blogs.

Continue Reading

26 Keywords for Data Mining?

It is interesting to note that Paul McFedries has described 26 words for data mining in his article at IEEE Spectrum. Below is a list of the words and their links to Wikipedia for your reading.

  1. knowledge engineers
  2. data preprocessing
  3. data warehouse
  4. data mart
  5. data cleansing
  6. dirty data
  7. noise
  8. knowledge discovery
  9. diapers and beer (seriously!)
  10. pattern mining
  11. association rules
  12. text mining
  13. audio mining
  14. audio indexing
  15. image mining
  16. video mining
  17. spatial mining
  18. geospatial mining
  19. crowd mining
  20. web mining
  21. data dredging
  22. data fishing
  23. data snooping
  24. automated data mining
  25. ??
  26. ??
Continue Reading

Data Mining News

MPs rush to sign up to legal data mining service – Computing

Computing

MPs rush to sign up to legal data mining service
Computing
Dods Legislation allows users to mine unstructured data from the European Commission and the European Parliament in order to track clauses or entire bills as they are updated or changed. The service, which costs between £6000 and £16000 per person, …
NASA Chat to Show Data Mining’s Safety Value – Occupational Health & Safety

NASA Chat to Show Data Mining’s Safety Value
Occupational Health & Safety
Data mining – analyzing “terabytes of aviation data to find issues before they become incidents,” as NASA explains it – is paying dividends for Southwest Airlines. The airline’s flight safety director, Capt. Jeff Hamlett, will explain how the airline …
Ford Offers F-Series Truck App Software for Towing – Truckinginfo

Ford Offers F-Series Truck App Software for Towing
Truckinginfo
More > Mobile data terminal supplier QSI Corp. released the Treq-M4x mobile data terminal…. More > Vigillo, which makes data mining software products that organize fleet safety information in scorecard format, announced its Affiliate Member Program. …
Googlers Buy More Junk Food Than Microsofties (And Why Rapleaf Is Creepy) – TechCrunch

Googlers Buy More Junk Food Than Microsofties (And Why Rapleaf Is Creepy)
TechCrunch
If you weren’t creeped out by data-mining startup Rapleaf after reading about their ways in a relatively unsettling Wall Street Journal article published last October (“The San Francisco startup says it has 1 billion e-mail addresses in its database”), …
and more »
Drilling Down With Informatica – Seeking Alpha

Drilling Down With Informatica
Seeking Alpha
I’ve been intrigued, not by the stock, but by its sector of data mining ever since I became aware of it a couple of years ago. Data mining stocks haven’t really piqued my interest that much, because the main players are software industry behemoths IBM …
and more »
HHS plans to pay for state data mining to spot Medicaid fraud – Government Health IT

HHS plans to pay for state data mining to spot Medicaid fraud
Government Health IT
The Health and Human Services Department has proposed that state Medicaid agencies be able to use federal funds to help pay for uncovering fraud through the screening and analysis of Medicaid claims data. Data mining is one of the …
HHS Seeks To Allow State Data Mining for Medicaid FraudiHealthBeat
HHS seeks to let states data-mine for fraudModernHealthcare.com
all 4 news articles »
Senate Hearings to Investigate Corporate Data Mining and Individual Privacy – AACRAO Transcript

DailyTech

Senate Hearings to Investigate Corporate Data Mining and Individual Privacy
AACRAO Transcript
On Wednesday, the Senate initiated a series hearings investigating corporate data mining and individual privacy. There is now an enormous multibillion-dollar industry based on the collection and sale of this personal and behavioral data, reports Time …
‘Do Not Track’ Becoming A RealityInternational Business Times
The White House Pushes For More Internet PrivacyThe Atlanta Post
Obama to Push Internet Privacy Bill, Create Online Tracking, Opt-OutDailyTech
all 206 news articles »
Medicaid Data Mining Proposed – GovInfoSecurity.com

GovInfoSecurity.com

Medicaid Data Mining Proposed
GovInfoSecurity.com
To ramp up efforts to detect Medicaid fraud, the Department of Health and Human Services is proposing a rule that would enable states to use federal matching funds to support Medicaid claims data mining. Current law prohibits use of the federal funds …
Rule to Enable Medicaid Data Mining for FraudHealth Data Management
all 3 news articles »
Data Mining: How Companies Now Know Everything About You – TIME

Data Mining: How Companies Now Know Everything About You
TIME
RapLeaf, a data-mining company that was recently banned by Facebook because it mined people’s user IDs, has me down as a 35-to-44-year-old married male with a graduate degree living in LA But RapLeaf thinks I have no kids, work as a medical …
and more »
Big Data Mining: Who Owns Your Social Network Data? – PCWorld

Big Data Mining: Who Owns Your Social Network Data?
PCWorld
Business analysts can study large data sets by renting servers for an hour, using technology such as Hadoop, he says. Mining the social networks’ Big Data Companies such as Echo and Cloudera are seeking their niche in the Big Data and social network …
and more »

For More Information about Data Minining click here

Continue Reading

Top 10 Data Mining Sites for Beginners

Here is my version of top 10 data mining sites for beginners in data mining/business intelligence/analytics. Anyway, welcome to the club 🙂

  1. Data Mining Community’s Top Resource (KDnuggets) – great info for software resources, education, jobs etc.
  2. StatSoft Data Mining Techniques – great info for data mining process, concepts, modeling, data warehousing etc. They even have online Statistics textbook that offers training in the understanding and application of statistics.
  3. Statistical Data Mining Tutorials (AutonLab) – compilation of a set of tutorials on many aspects of statistical data mining, including the foundations of probability, the foundations of statistical data analysis, and most of the classic machine learning and data mining algorithms.
  4. Kurt Thearling Data Mining – a great introduction to data mining foundation, including data mining architecture and glossary terms.
  5. The Data Mine – collaborative input using Twiki about data mining and knowledge discovery including exhaustive list of software/tools (free and commercials).
  6. Videolectures on Data Mining – popular video lecture on recent data mining technologies by data mining people.
  7. Video Tutorials (sentimentmining) – an online data mining video tutorials using WEKA for beginners, including how to perform text mining, clustering, neural network and sentiment mining.
  8. Information Management – an up-to-date publication for business intelligence, analytics, integration and data warehousing.
  9. Data Mining and Knowledge Discovery – premier technical journal focused on the theory, techniques and practice for extracting information from large databases.
  10. Wikipedia on Data Mining – last but not least an online encyclopedia on data mining including, background, process and applications of data mining.
Continue Reading