Resource page for BAEIR Members and Friends

Feel free to suggest useful stuff to be listed here.

Lab Meeting Material

  1. 2019 Spring (restricted access)
  2. 2018 Fall (restricted access)
  3. 2018 Spring (restricted access)
  4. 2017 Fall (restricted access)
  5. 2017 Spring (restricted access)
  6. 2016 Fall (restricted access)
  7. 2016 Spring (restricted access)
  8. 2015 Fall (restricted access)
  9. 2015 Summer Bootcamp (restricted access)
  10. 2015 Spring (restricted access)
  11. 2014 Fall (restricted access)

Important Conferences:

  1. KDD; ACM SigKDD Conference on Knowledge Discovery and Data Mining; http://www.kdd.org/
    • KDD 2013, http://www.kdd.org/kdd2013/, local archive at http://weiwei.lu.im.ntu.edu.tw/kdd2013/kdd2013.htm (username/password: kdd2013/2013kdd)
    • KDD 2012, http://kdd2012.sigkdd.org/, http://weiwei.lu.im.ntu.edu.tw/kdd2012/
  2. NIPS, http://nips.cc/
  3. ICML, http://icml.cc/2015/
  4. ICDM; IEEE International Conference on Data Mining
  5. ICIS 2014, http://icis2014.aisnet.org/

Important Journals (IS oriented):

  1. MIS Quarterly
  2. Information Systems Research
  3. Journal of Management Information Systems
  4. Decision Support Systems

Import Journals (Technical):

  1. ACM Transactions on Information Systems
  2. IEEE Transactions Knowledge and Data Engineering

Real Time Status (cluster, network traffic)

You are welcome to apply for an account on common.lu.im.ntu.edu.tw (Ubuntu 12 LTS)

  1. Lab Torque Cluster Status (originally created by 鈺嫻)
  2. Traffice flow: 資管系

Opensource Tools

  1. R (statistical inference platform), http://www.r-project.org/
    1. Rcpp: a good way to improve the speed of you R code. You may also need RcppArmadillo.
    2. Writing C extension for R: The standard way to to improve the speed of your R code.
  2. Natural language processing and text mining:
    1. OpenNLP (https://opennlp.apache.org/),
    2. Lingpipe (http://alias-i.com/lingpipe/),
    3. Standford NLP tools (http://nlp.stanford.edu/software/index.shtml)
    4. Mallet (http://mallet.cs.umass.edu/)
    5. NLTK (http://www.nltk.org/), a Python library

Dataset: Public

  1. UCI Machine Learning Repository
  2. Kaggle
  3. Open Government Data
  4. KDnugget
  5. StatLib
  6. University of Edinburgh
  7. KDD Cup (1997 - 2010); use Google to locate newer KDD CUP datasets.
  8. Enron Email Dataset
  9. MovieLens
  10. 467 Million Twitter tweets; also see the local dataset
  11. Wikipedia Downloads
  12. Wikipedia pagecount; also see the local dataset
  13. WRDS: Accounting data, stock return, high frequency trading, analysts forecasts (you need to apply for an account through the college); also see the local dataset

Dataset: Local/Private (restricted access)

Museum for Past Master Theses (restricted access)