[1] Ren, Shaoqing, et al. "Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks." Advances in neural information processing systems. 2015.
[2] Zhou, Bolei, et al. "Places2: A Large-scale Database for Scene Understanding." Arxiv preprint:[pending] (2015).
[3] Szegedy, Christian, et al. "Going Deeper with Convolutions." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015.
[4] Ioffe, Sergey, and Christian Szegedy. "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift." ArXiv preprint arXiv:1502.03167 (2015).
[5] Szegedy, Christian, et al. "Rethinking the Inception Architecture for Computer Vision." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016.
[6] Szegedy, Christian, et al. "Inception-v4, Inception-Resnet and the Impact of Residual Connections on Learning." ArXiv preprint arXiv:1602.07261 (2016).
[7] Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in Network." ArXiv preprint arXiv:1312.4400 (2013).
[8] https://www.tensorflow.org/tutorials/image_retraining
[9] https://github.com/metalbubble/places_devkit
[10] https://docs.google.com/presentation/d/1_wdSh2PFxiqBegt5PcatbEiQaganlgdb5bH7V2jHXZI/mobilepresent?slide=id.p