Difference between revisions of "Computer Vision"

From Ioannis Kourouklides
Jump to navigation Jump to search
 
(35 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
[[File:Reconstruct-3D.jpg|right|230px]]  
 
[[File:Reconstruct-3D.jpg|right|230px]]  
 +
 
This page contains resources about [http://en.wikipedia.org/wiki/Computer_vision Computer Vision], '''Machine Vision''' and [http://en.wikipedia.org/wiki/Image_processing Image Processing] in general.
 
This page contains resources about [http://en.wikipedia.org/wiki/Computer_vision Computer Vision], '''Machine Vision''' and [http://en.wikipedia.org/wiki/Image_processing Image Processing] in general.
  
Line 6: Line 7:
 
==Subfields and Concepts==
 
==Subfields and Concepts==
 
''See [http://kourouklides.wikia.com/wiki/Category:Computer_Vision Category:Computer Vision] for some of its subfields.''
 
''See [http://kourouklides.wikia.com/wiki/Category:Computer_Vision Category:Computer Vision] for some of its subfields.''
 +
* Image Preprocessing
 +
** Image Augmentation
 
* Low-level Vision
 
* Low-level Vision
 
** [[Digital Image Processing|Digital Image Processing]]
 
** [[Digital Image Processing|Digital Image Processing]]
 
** Feature extraction
 
** Feature extraction
** Edge detection
+
*** Hough Transform
 +
** Feature detection
 +
*** Edge detection
 +
*** Corner detection
 
** Optical flow
 
** Optical flow
 
* Intermediate-level Vision
 
* Intermediate-level Vision
** Object recognition
+
** Recognition tasks
 +
*** Face recognition
 +
** Object detection
 +
*** Face detection
 +
*** Pedestrian detection
 +
** Image segmentation
 +
** Semantic image segmentation
 +
** Image registration
 
** 3D reconstruction
 
** 3D reconstruction
 
** Motion analysis
 
** Motion analysis
 +
** Texture Analysis and Synthesis
 +
*** Co-occurrence Matrix
 
* High-level Vision / Image Understanding
 
* High-level Vision / Image Understanding
 
** [[Machine Learning]]
 
** [[Machine Learning]]
 
* Structure from Motion
 
* Structure from Motion
 +
* [[Simultaneous Localization and Mapping]] (SLAM)
 +
* 3D point clouds
 +
* Optical Character Recognition (OCR)
 +
* Place and Object recognition
 +
** Object detection
 +
** Object localization
 +
** Object classification
 +
** Scene classification
 +
** Scene recognition
 +
** Semantic Scene Understanding
 +
* Feature descriptors
 +
** Scale-invariant feature transform (SIFT)
 +
** Speeded up robust features (SURF)
 +
** Histogram of oriented gradients (HOG)
 
* [[Medical Imaging|Medical Image Computing / Medical Image Analysis]]
 
* [[Medical Imaging|Medical Image Computing / Medical Image Analysis]]
 +
* ([[Algorithm|Combinatorial/Algorithmic]]) Computational Geometry & Discrete Geometry
 
* [[Computer Graphics]]
 
* [[Computer Graphics]]
 +
** Inverse Graphics
  
 
==Online Courses==
 
==Online Courses==
Line 26: Line 57:
 
* [https://www.coursera.org/course/images Image and video processing by Guillermo Sapiro] ([https://www.youtube.com/watch?v=bxhJEe38bhY&list=PLZ9qNFMHZ-A79y1StvUUqgyL-O0fZh2rs Youtube] )
 
* [https://www.coursera.org/course/images Image and video processing by Guillermo Sapiro] ([https://www.youtube.com/watch?v=bxhJEe38bhY&list=PLZ9qNFMHZ-A79y1StvUUqgyL-O0fZh2rs Youtube] )
 
* [http://homepages.inf.ed.ac.uk/rbf/AVINVERTED/main_av.htm Advanced Vision by Bob Fisher]
 
* [http://homepages.inf.ed.ac.uk/rbf/AVINVERTED/main_av.htm Advanced Vision by Bob Fisher]
 
+
* [https://www.coursera.org/learn/practical-rl Deep Learning in Computer Vision] - Coursera
  
 
===Lecture Notes===
 
===Lecture Notes===
Line 53: Line 84:
  
 
===Specialized===
 
===Specialized===
 +
* Boissonnat, J. D., Chazal, F., & Yvinec, M. (2018). ''Geometric and Topological Inference''. Cambridge University Press. ([https://geometrica.saclay.inria.fr/team/Fred.Chazal/papers/CGLcourseNotes/main.pdf link])
 
* Prince, S. J. D. (2012). ''Computer Vision: Models, Learning, and Inference''. Cambridge University Press.
 
* Prince, S. J. D. (2012). ''Computer Vision: Models, Learning, and Inference''. Cambridge University Press.
 
* Nowozin, S., & Lampert, C. H. (2011). ''Structured Prediction and Learning in Computer Vision''. Foundations and Trends in Computer Graphics and Vision, 6(3-4), 3-4.
 
* Nowozin, S., & Lampert, C. H. (2011). ''Structured Prediction and Learning in Computer Vision''. Foundations and Trends in Computer Graphics and Vision, 6(3-4), 3-4.
* Hyvärinen, A., Hurri, J. & Hoyer, P. O. (2009). ''Natural Image Statistics: A Probabilistic Approach to Early Computational Vision''. Springer.
+
* Hyvarinen, A., Hurri, J. & Hoyer, P. O. (2009). ''Natural Image Statistics: A Probabilistic Approach to Early Computational Vision''. Springer.
* Ma, Y. (Ed.). (2004). ''An Invitation to 3D Vision: From Images to Geometric Models'' (Vol. 26). Springer.
+
* Ma, Y. (Ed.). (2004). ''An Invitation to 3D Vision: From Images to Geometric Models''. Springer.
  
 
==Software==
 
==Software==
Line 68: Line 100:
 
* [http://www.mathworks.co.uk/help/vision/index.html Computer Vision System Toolbox]- MATLAB
 
* [http://www.mathworks.co.uk/help/vision/index.html Computer Vision System Toolbox]- MATLAB
 
* [http://vxl.sourceforge.net/ VXL] - C++
 
* [http://vxl.sourceforge.net/ VXL] - C++
* [http://processing.org/ Processing] - IDE for Computational Artitsts promoting software literacy within the visual arts 
+
* [http://processing.org/ Processing] - IDE for Computational Artists promoting software literacy within the visual arts 
 +
 
 +
==Datasets==
 +
* [https://project.inria.fr/aerialimagelabeling/ Inria Aerial Image Labeling Dataset]
 +
* [http://cvgl.stanford.edu/projects/uav_data/ Stanford Drone Dataset]
 +
* [https://deepmind.com/research/open-source/open-source-datasets/kinetics/ Kinetics-400 and Kinetics-600 datasets by DeepMind]
 +
* [https://computervisiononline.com/datasets List of datasets by Computer Vision Online]
 +
* [http://www.cvpapers.com/datasets.html CV Datasets on the web]
  
 
==See also==
 
==See also==
 
* [[Machine Learning]]
 
* [[Machine Learning]]
 +
* [[Topology]]
  
 
==Other Resources==
 
==Other Resources==
Line 81: Line 121:
 
*[http://homepages.inf.ed.ac.uk/rbf/CVonline/ CVonline] - The evolving, distributed, non-proprietary, on-line compendium of Computer Vision
 
*[http://homepages.inf.ed.ac.uk/rbf/CVonline/ CVonline] - The evolving, distributed, non-proprietary, on-line compendium of Computer Vision
 
*[http://semicolonproducts.blogspot.co.uk/2014/04/computer-vision-systems-in-nutshell.html Computer VIsion systems in a nutshell]
 
*[http://semicolonproducts.blogspot.co.uk/2014/04/computer-vision-systems-in-nutshell.html Computer VIsion systems in a nutshell]
 +
*[https://alexgkendall.com/computer_vision/have_we_forgotten_about_geometry_in_computer_vision/ Have We Forgotten about Geometry in Computer Vision?]
 +
*[https://towardsdatascience.com/from-tda-to-dl-d06f234f51d From Topological Data Analysis to Deep Learning: No Pain No Gain] - blog post
 +
*[http://www.cvlibs.net/datasets/kitti/ KITTI Vision Benchmark Suite]
  
 
[[Category:Computer_Vision]]
 
[[Category:Computer_Vision]]

Latest revision as of 23:46, 14 January 2019

Reconstruct-3D.jpg

This page contains resources about Computer Vision, Machine Vision and Image Processing in general.

More specific information is included in each subfield.

Subfields and Concepts[edit]

See Category:Computer Vision for some of its subfields.

  • Image Preprocessing
    • Image Augmentation
  • Low-level Vision
    • Digital Image Processing
    • Feature extraction
      • Hough Transform
    • Feature detection
      • Edge detection
      • Corner detection
    • Optical flow
  • Intermediate-level Vision
    • Recognition tasks
      • Face recognition
    • Object detection
      • Face detection
      • Pedestrian detection
    • Image segmentation
    • Semantic image segmentation
    • Image registration
    • 3D reconstruction
    • Motion analysis
    • Texture Analysis and Synthesis
      • Co-occurrence Matrix
  • High-level Vision / Image Understanding
  • Structure from Motion
  • Simultaneous Localization and Mapping (SLAM)
  • 3D point clouds
  • Optical Character Recognition (OCR)
  • Place and Object recognition
    • Object detection
    • Object localization
    • Object classification
    • Scene classification
    • Scene recognition
    • Semantic Scene Understanding
  • Feature descriptors
    • Scale-invariant feature transform (SIFT)
    • Speeded up robust features (SURF)
    • Histogram of oriented gradients (HOG)
  • Medical Image Computing / Medical Image Analysis
  • (Combinatorial/Algorithmic) Computational Geometry & Discrete Geometry
  • Computer Graphics
    • Inverse Graphics

Online Courses[edit]

Video Lectures[edit]

Lecture Notes[edit]

Books[edit]

Practical[edit]

  • Howse, J. (2013). OpenCV Computer Vision with Python. Packt Publishing Ltd.
  • Demaagd, K., Oliver, A., Oostendorp, N., & Scott, K. (2012). Practical Computer Vision with SimpleCV: The Simple Way to Make Technology See. O'Reilly Media, Inc.
  • Solem, J. E. (2012). Programming Computer Vision with Python: Tools and algorithms for analyzing images. O'Reilly Media, Inc.
  • Bradski, G., & Kaehler, A. (2008). Learning OpenCV: Computer Vision with the OpenCV Library. O'Reilly Media, Inc.

Introductory[edit]

  • Szeliski, R. (2010). Computer Vision: Algorithms and Applications. Springer.
  • Zisserman, A., & Hartley, R. (2004). Multiple View Geometry in Computer Vision. Cambridge University Press.
  • Forsyth, D. A., & Ponce, J. (2002). Computer Vision: A Modern Approach. Prentice Hall.

Advanced[edit]

  • Jahne, B., Geissler, P., & Haussecker, H. (1999). Handbook of Computer Vision and Applications with CD-ROM. Morgan Kaufmann Publishers Inc.

Specialized[edit]

  • Boissonnat, J. D., Chazal, F., & Yvinec, M. (2018). Geometric and Topological Inference. Cambridge University Press. (link)
  • Prince, S. J. D. (2012). Computer Vision: Models, Learning, and Inference. Cambridge University Press.
  • Nowozin, S., & Lampert, C. H. (2011). Structured Prediction and Learning in Computer Vision. Foundations and Trends in Computer Graphics and Vision, 6(3-4), 3-4.
  • Hyvarinen, A., Hurri, J. & Hoyer, P. O. (2009). Natural Image Statistics: A Probabilistic Approach to Early Computational Vision. Springer.
  • Ma, Y. (Ed.). (2004). An Invitation to 3D Vision: From Images to Geometric Models. Springer.

Software[edit]

Datasets[edit]

See also[edit]

Other Resources[edit]