VS298: Unsolved Problems in Vision: Difference between revisions

From RedwoodCenter
Jump to navigationJump to search
No edit summary
No edit summary
 
(72 intermediate revisions by the same user not shown)
Line 10: Line 10:


'''Meeting time and place''':
'''Meeting time and place''':
 
Tuesday 6-8, 560 Evans (Redwood Center conference room)
Tuesday 6-8, 489 Minor


'''Email list''':
'''Email list''':


vs298-unsolved-problems@lists.berkeley.edu [https://calmail.berkeley.edu/manage/list/ subscribe]
*Seminar mailing list vs298_unsolved_problems_in_vision@lists.berkeley.edu [https://calmail.berkeley.edu/manage/list/listinfo_subscribe/vs298_unsolved_problems_in_vision@lists.berkeley.edu subscribe]
 
*Lecture series mailing list [https://calmail.berkeley.edu/manage/list/listinfo_subscribe/unsolved-problems@lists.berkeley.edu subscribe]


'''Weekly schedule''':
'''Weekly schedule''':
Line 41: Line 40:
|-  
|-  
| Sept. 16
| Sept. 16
| Evening seminar, focus on student projects: form groups and discuss proposal topics
| Evening seminar, focus on student projects: form groups and discuss proposal topics [https://www.dropbox.com/s/2onrsi6n6338bjb/4Dec2014%20Grant%20PDF.pdf?dl=0 grant_format]
|-
|-
| Sept. 19 <br />
| Sept. 19 <br />
(Friday) 11:00 a.m., 5101 Tolman
(Friday) 11:00 a.m., 5101 Tolman
| <br /> '''Gary Marcus lecture: Computational diversity and the mesoscale organization of the neocortex'''
| <br /> '''Gary Marcus lecture: Computational diversity and the mesoscale organization of the neocortex''' [https://archive.org/details/Redwood_Center_2014_09_19_Gary_Marcus video]
<br />
<br />
The human neocortex participates in a wide range of tasks, yet superficially appears to adhere to a relatively uniform six-layered architecture throughout its extent. For that reason, much research has been devoted to characterizing a single "canonical" cortical computation, repeated massively throughout the cortex, with differences between areas presumed to arise from their inputs and outputs rather than from “intrinsic” properties. There is as yet no consensus, however, about what such a canonical computation might be, little evidence that uniform systems can capture abstract and symbolic computation (e.g., language) and little contact between proposals for a single canonical circuit and complexities such as differential gene expression across cortex, or the diversity of neurons and synapse types. Here, we evaluate and synthesize diverse evidence for a different way of thinking about neocortical architecture, which we believe to be more compatible with evolutionary and developmental biology, as well as with the inherent diversity of cortical functions. In this conception, the cortex is composed of an array of reconfigurable computational blocks, each capable of performing a variety of distinct operations, and possibly evolved through duplication and divergence. The computation performed by each block depends on its internal configuration. Area-specific specialization arises as a function of differing configurations of the local logic blocks, area-specific long-range axonal projection patterns and area-specific properties of the input. This view provides a possible framework for integrating detailed knowledge of cortical microcircuitry with computational characterizations. With Adam Marblestone, MIT and Tom Dean, Google
|-  
|-  
| Sept. 23
| Sept. 23
|  Marcus discussion <br />
|  Marcus discussion (postponed) <br />
Feldman background discussion<br />
Feldman background discussion<br />
*NEW MATERIALs:
* Feldman, J. (2008). From molecule to metaphor: A neural theory of language. MIT press. (look at Chapters 1, 2, 9, and 26 before class) [http://site.ebrary.com/lib/berkeley/detail.action?docID=10173587 pdf]
*OLD MATERIALS:
*Feldman, J. (2013). The neural binding problem (s). Cognitive neurodynamics, 7(1), 1-11. [ftp://ftp.icsi.berkeley.edu/pub/feldman/binding.cody.pdf pdf]
*Feldman, J. & Narayanan, S. (2014). Affordances, Actionability, and Simulation. Affordances Workshop, Robotics Science and Systems 2014, Berkeley, CA [ftp://ftp.icsi.berkeley.edu/pub/feldman/affordances.jf.pdf pdf]
* Feldman, J. (2010). Ecological expected utility and the mythical neural code. Cognitive neurodynamics, 4(1), 25-35. [ftp://ftp.icsi.berkeley.edu/pub/feldman/eeu.pdf pdf]
* F. T. Sommer: Neural oscillatons and synchrony as mechanisms for coding, communication and computation in the visual system. Chapter in: The New Visual Neurosciences, Eds.: Leo M. Chalupa and John S. Werner, MIT Press (2014) [http://mitpress.mit.edu/sites/default/files/titles/content/9780262019163_toc_0001.pdf pdf_contents]
|-
|-
| Sept. 30
| Sept. 30
| <br />'''Feldman lecture''' (tentative)
|  Discuss student projects<br />
|-
| Wed. Oct. 1, 4:15 p.m., 489 Minor Hall
| <br />'''Feldman lecture: The neural binding problem(s) and related mysteries''' [https://archive.org/details/uc_berkeley_vs298_2014_10_01_Jerry_Feldman video]
As with many other “problems” in vision and cognitive science, “the binding problem” has been used to label a wide range of tasks of radically different behavioral and computational structure. These include a “hard” version that is currently intractable, a feature-binding variant that is productive routine science and a variable-binding case that is unsolved, but should be solvable. The talk will cover all these and some related problems that seem intractably hard as well as some that are unsolved, but are being approached with current and planned experiments.
<br />
<br />
|-
|-
Line 59: Line 70:
| Feldman discussion <br />
| Feldman discussion <br />
Malik background discussion<br />
Malik background discussion<br />
* Malik papers...
* R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation". Proc. of CVPR 2014. [http://www.cs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf pdf]
 
*P. Arbelaez, J. Pont-Tuset,  J. Barron, F. Marques and J. Malik, "Multiscale Combinatorial Grouping", Proc. of CVPR 2014. [http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/mcg/ pdf]
 
*B. Hariharan, P. Arbelaez, R. Girshick and J. Malik, Simultaneous Detection and Segmentation. ECCV (7) 2014: 297-312. [http://www.cs.berkeley.edu/~bharath2/pubs/pdfs/BharathECCV2014.pdf pdf]
 
*S. Gupta, R. Girshick, P. Arbelaez, J. Malik: Learning Rich Features from RGB-D Images for Object Detection and Segmentation. ECCV (7) 2014: 345-360. [http://www.cs.berkeley.edu/~sgupta/pdf/rcnn-depth.pdf pdf]
|-
|Tue. Oct. 14, 6 to 8 p.m. 560 Evans
| <br />'''Jitendra Malik lecture: The Three R's of Computer Vision: Recognition, Reconstruction and Reorganization''' [https://archive.org/details/ucb_vs298_2014_10_14_Jitendra_Malik video]
<br />
|-
|-
| Oct. 14
| Oct. 20, 12-1:30 p.m., Minor 489
| <br />'''Jitendra Malik lecture''' (tentative date)
| <br />'''Harold Bedell lecture: Contour interaction: as far from the muddling crowd?''' [https://archive.org/details/ucb_vs298_2014_10_20_Harold_Bedell video]
Contour interaction describes the interference with target recognition that occurs in the presence of nearby flanking edges. As one of the pioneers in this research, Flom distinguished between contour interaction and crowding, in which contributions to spatial interference can derive also from additional factors, such as inaccurate eye movements and attentional processes. In the normal fovea, contour interaction and crowding have a similar magnitude and operate over a similar spatial extent. Both foveal contour interaction and crowding are reduced when the luminance of the stimulus is decreased. Unlike the fovea, the magnitude and extent of contour interaction in peripheral vision are considerably more limited than crowding. Further, peripheral contour interaction and crowding are not affected substantively by target luminance. Indeed, the magnitude and extent of peripheral contour interaction are similar for photopic and scotopic targets. These results suggest that the contributions of specific mechanisms may differ for foveal and peripheral contour interaction and crowding.
<br />
<br />
Readings:
*Siderov, J., Waugh, S. J., & Bedell, H. E. (2013). Foveal contour interaction for low contrast acuity targets. Vision research, 77, 10-13. [https://www.dropbox.com/s/chxuluvmy1hrumh/Siderov_Bedell_Waugh2013.pdf?dl=0 pdf]
*Coates, D. R., & Levi, D. M. (2014). Contour interaction in foveal vision: A response to. Vision research, 96, 140-144. [https://www.dropbox.com/s/7tycw0m71ocw4cw/Coates%20Levi%20Ltr%20VR_14.pdf?dl=0 pdf]
*Siderov, J., Waugh, S. J., & Bedell, H. E. (2014). Foveal contour interaction on the edge: Response to ‘Letter-to-the-Editor’by Drs. Coates and Levi. Vision research, 96, 145-148. [https://www.dropbox.com/s/2so6z5yxrsgs5iy/Siderov%20etal%20CI%20on%20Edge%20Reply%20VR_14.pdf?dl=0 pdf]
|-
|-
| Oct. 21
| Oct. 21
| Malik discussion <br />
| Malik discussion <br />
Nakayama background discussion<br />
Nakayama and Shimojo background discussion<br />
* Nakayama, K. (1999). Mid-level vision. In R. A. Wilson & F. C. Keil (Eds.), The MIT encylopedia of the cognitive sciences Cambridge: MIT Press  [http://visionlab.harvard.edu/members/ken/Papers/100mitencyclopedia99.pdf pdf]
* Nakayama, K. (1999). Mid-level vision. In R. A. Wilson & F. C. Keil (Eds.), The MIT encylopedia of the cognitive sciences Cambridge: MIT Press  [http://visionlab.harvard.edu/members/ken/Papers/100mitencyclopedia99.pdf pdf]
* Nakayama, K. (2010)  "Vision going social." The science of social vision. Adams, R.B. Jr., Ambady, N., Nakayama, K. & Shimojo, S. (Eds) Oxford University Press [http://visionlab.harvard.edu/members/ken/Papers/160NakayamaVisionGoingSocial.pdf pdf]
* Nakayama, K. (2010)  "Vision going social." The science of social vision. Adams, R.B. Jr., Ambady, N., Nakayama, K. & Shimojo, S. (Eds) Oxford University Press [http://visionlab.harvard.edu/members/ken/Papers/160NakayamaVisionGoingSocial.pdf pdf]
* Nakayama, K. and Martini, P. (2011) Situating Visual Search. Vision Research, 51, 1526-1537. [http://visionlab.harvard.edu/members/ken/Papers/166NakayamaMartiniSVS2011.pdf pdf]
* Nakayama, K. and Martini, P. (2011) Situating Visual Search. Vision Research, 51, 1526-1537. [http://visionlab.harvard.edu/members/ken/Papers/166NakayamaMartiniSVS2011.pdf pdf]
(All Nakayama pubs available [http://visionlab.harvard.edu/members/ken/nakayamapub.htm here])
(All Nakayama pubs available [http://visionlab.harvard.edu/members/ken/nakayamapub.htm here])
*Shimojo, S. (2014). Postdiction: its implications on visual awareness, hindsight, and sense of agency. Frontiers in psychology, 5. [http://journal.frontiersin.org/Journal/10.3389/fpsyg.2014.00196/full pdf]
|-
|-
| Oct. 28
| Oct. 28, 6-8 p.m., 560 Evans
| <br />'''Ken Nakayama lecture'''
| <br />'''Ken Nakayama lecture: The scientist’s choice: solving, explaining, discovering . . . . '''
<br />
<br />
|-
|-
| Nov. 3 (Monday) 12:30 p.m. 489 Minor Hall
| Nov. 3 (Monday)
| <br />'''Shinsuke Shimojo lecture: Contour Interaction: as far from the muddling crowd?'''
12:00 p.m.
489 Minor Hall
| <br />'''Shinsuke Shimojo lecture: Postdiction: its implications on visual awareness, hindsight, and sense of agency''' [https://archive.org/details/ucb_vs298_2014_11_03_Shinsuke_Shimojo video]
<br />
|- |-
| Nov. 3 (Monday)
3:30-4:30 p.m.
560 Evans Hall
| <br />Discussion with Shinshuke Shimojo
<br />
<br />
|-  
|-  
Line 85: Line 121:
Wandell background discussion<br />
Wandell background discussion<br />
* To appear: Computational modeling of responses in human visual cortex. BA Wandell, J Winawer, KN Kay.
* To appear: Computational modeling of responses in human visual cortex. BA Wandell, J Winawer, KN Kay.
In Brain Mapping:  An Encyclopedic Reference (Edited by Thompson and Friston.)
In Brain Mapping:  An Encyclopedic Reference (Edited by Thompson and Friston.) [http://white.stanford.edu/~brian/papers/mri/2014-WandellWinawerKay-CorticalModeling-Encyclopedia.pdf pdf]
|-
|-
| (Friday) Nov. 14
| (Friday) Nov. 14 11 a.m., 560 Evans Hall
| <br />'''Brian Wandell lecture'''
| <br />'''Brian Wandell lecture'''
<br />
<br />
|-
|-
| Nov. 18
| Nov. 18
| Wandell discussion <br />
| Consciousness discussion <br />
Gallant background discussion<br />
 
* Gallant papers...
|-
|-
| Nov. 25
| Nov. 25
|| <br /> '''Jack Gallant lecture'''
|| <br />
<br />
Evening seminar: Gallant Discussion<br />
Koch background discussion<br />
Koch background discussion<br />
*Koch, C. Project MindScope [https://www.dropbox.com/s/zslgzwyu9pey1mz/Future%20of%20the%20Brain%20-%20MindScope%20%2715.pdf?dl=0 pdf]
*Tsuchiya, N., & Koch, C. (2008). The relationship between consciousness and attention. The neurology of consciousness: Cognitive neuroscience and neuropathology, 63-78. [https://www.dropbox.com/s/huehdgxd77jzjha/Tsuchiya%20%26%20Koch%20%2714.pdf?dl=0 pdf]
*Klein, S. A. (1993). Will robots see? Chapter in Spatial Vision in Humans and Robots, Cambridge University Press, 184-199. [https://www.dropbox.com/s/0u15k6yf9o8zrvo/Klein%20Will%20Robots%20See.pdf?dl=0 pdf]
*Tononi, G., & Koch, C. (2014). Consciousness: Here, There but Not Everywhere. arXiv preprint arXiv:1405.7089. [http://arxiv.org/pdf/1405.7089.pdf pdf]
*Tononi, G., & Koch, C. (2014). Consciousness: Here, There but Not Everywhere. arXiv preprint arXiv:1405.7089. [http://arxiv.org/pdf/1405.7089.pdf pdf]
*Scientific American [http://blogs.scientificamerican.com/cross-check/2012/04/02/christof-koch-on-free-will-the-singularity-and-the-quest-to-crack-consciousness/ article]
*Scientific American [http://blogs.scientificamerican.com/cross-check/2012/04/02/christof-koch-on-free-will-the-singularity-and-the-quest-to-crack-consciousness/ article]
*Scientific American [http://www.scientificamerican.com/article/consciousness-does-not-reside-here/ article]
*Scientific American [http://www.scientificamerican.com/article/consciousness-does-not-reside-here/ article]
|-
|-
| Dec. 2
| Dec. 2, 4-6 p.m., 125 Li Ka Shing
| <br />'''Christof Koch lecture: Unsolved Problems in Vision: Consciousness.'''
| <br />'''Christof Koch lecture: Unsolved Problems in Vision: Consciousness.'''
Evening seminar: Koch Discussion<br /><br />
Evening seminar: Koch Discussion<br /><br />
|}
|}
<br />
<br />
'''Additional Readings'''
'''Additional Materials'''
*recent special issue of CurrOpinNeuro [http://www.sciencedirect.com/science/journal/09594388/25/supp/C journal]
*recent special issue of CurrOpinNeuro [http://www.sciencedirect.com/science/journal/09594388/25/supp/C journal]
*Rao, R. P., Olshausen, B. A., & Lewicki, M. S. (2002). Probabilistic models of the brain: Perception and neural function. Mit Press. [http://redwood.berkeley.edu/bruno/papers/sparse-spikes.pdf pdf]
*Olshausen BA Olshausen (2013) Perception as an Inference Problem. [http://redwood.berkeley.edu/bruno/papers/perception-as-inference.pdf pdf]
*Olshausen BA (2012)  20 years of learning about vision: Questions answered, questions unanswered, and questions not yet asked. In: 20 Years of Computational Neuroscience (Symposium of the CNS 2010 annual meeting) [http://redwood.berkeley.edu/bruno/papers/CNS2010-chapter.pdf pdf]
*Kitaoka, A (2014) Color-dependent motion illusions in stationary images and their phenomenal dimorphism. Perception advance online publication [http://www.perceptionweb.com/openaccess/p7706.pdf pdf]
*O'Regan, J. K., & Noë, A. (2001). A sensorimotor account of vision and visual consciousness. Behavioral and brain sciences, 24(05), 939-973.[https://www.dropbox.com/s/tbih5x5vo1o5en7/2001%20A%20sensorimotor%20account%20of%20vision%20and%20visual%20consciousness.pdf?dl=0 pdf]
*Bruno Olshausen lecture (1 July 2014) 20 Years of Learning About Vision:  Questions Answered, Questions Unanswered, and Questions Not Yet Asked  [https://archive.org/details/ucb_vision_science_2014_07_01_Bruno_Olshausen video]
*Solari, S. V. H., & Stoner, R. (2011). Cognitive consilience: primate non-primary neuroanatomical circuits underlying cognition. Frontiers in neuroanatomy, 5. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3243081/pdf/fnana-05-00065.pdf pdf]
*Dyson, Freeman. The Case for Blunders. The New York Review of Books, 6 March 2014. [http://www.nybooks.com/articles/archives/2014/mar/06/darwin-einstein-case-for-blunders/ pdf]
*Machine-Learning Maestro Michael Jordan on the Delusions of Big Data and Other Huge Engineering Efforts,  20 Oct 2014, Lee Gomes, IEEE Specturm [http://spectrum.ieee.org/robotics/artificial-intelligence/machinelearning-maestro-michael-jordan-on-the-delusions-of-big-data-and-other-huge-engineering-efforts link]
*Yann LeCunn responds to Mike Jordan's Spectrum interview [https://www.facebook.com/yann.lecun/posts/10152348155137143 link]
*Kravitz, D. J., Saleem, K. S., Baker, C. I., Ungerleider, L. G., & Mishkin, M. (2013). The ventral visual pathway: an expanded neural framework for the processing of object quality. Trends in cognitive sciences, 17(1), 26-49. [http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3532569/pdf/nihms419686.pdf pdf]
* Vinyals, O. et al. Show and Tell: A Neural Image Caption Generator. 2014 arXiv.1411.4555v1 [http://arxiv.org/pdf/1411.4555v1.pdf pdf]
*Koch, C., & Tononi, G. (2011). A test for consciousness. Scientific American, 304(6), 44-47. [pdf https://www.dropbox.com/s/h2bo3swrjr1g1l1/A_Test_for_Consciousness.pdf?dl=0]

Latest revision as of 20:35, 4 December 2014

One of the goals of vision science is to understand the nature of perception and its neural substrates. There are now many well established techniques and paradigms in both psychophysics and neuroscience to address problems in vision. However, knowing how to frame these questions for investigation is not necessarily obvious. Nervous systems present us with stunning complexity, and the purpose of perception itself is deeply mysterious. The goal of this seminar course is to step back and ask, what are the important problems that remain unsolved in vision research, and how should these be approached empirically? The course will consist of alternating weeks of discussion and guest lectures by vision scientists who will frame their views of the core unsolved problems. Interdisciplinary groups of students will devise a practical research plan to address an unsolved problem of their choice.

Instructors: Stan Klein, Jerry Feldman, Bruno Olshausen, and Karl Zipser
GSI: Dan Coates

Enrollment information:

VS 298 (section 2), 2 units
CCN: 66478

Meeting time and place: Tuesday 6-8, 560 Evans (Redwood Center conference room)

Email list:

  • Seminar mailing list vs298_unsolved_problems_in_vision@lists.berkeley.edu subscribe
  • Lecture series mailing list subscribe

Weekly schedule:

Date Topic/Reading
Sept. 2 Introduction
Sept. 9 Methodology in vision science (Stan Klein)
  • Double-judgment psychophysics: problems and solutions pdf Read pp 1560-1567 This will give a glimpse into some of the issues involved with the relationship between detecting and identifying an object. The second part of the paper is more complicated.
  • Measuring, estimating, and understanding the psychometric function: A commentary pdf I (Stan Klein) was an editor of a special issue of "Perception & Psychophyics" and I wrote the summary article not only commenting on a number of the articles, but also trying to clarify some misunderstood aspects in the field.
  • Psychophysics : A Practical Introduction site This is the text by Kingdom and Prins that I've used when teaching psychophysics methods. I suggest reading Chapters 2 & 3. Some of the dichotomies in Chapter 2 are directly relevant to a number of unsolved problems in vision. Some might even be insoluble.

Marcus background discussion

Sept. 16 Evening seminar, focus on student projects: form groups and discuss proposal topics grant_format
Sept. 19

(Friday) 11:00 a.m., 5101 Tolman


Gary Marcus lecture: Computational diversity and the mesoscale organization of the neocortex video


The human neocortex participates in a wide range of tasks, yet superficially appears to adhere to a relatively uniform six-layered architecture throughout its extent. For that reason, much research has been devoted to characterizing a single "canonical" cortical computation, repeated massively throughout the cortex, with differences between areas presumed to arise from their inputs and outputs rather than from “intrinsic” properties. There is as yet no consensus, however, about what such a canonical computation might be, little evidence that uniform systems can capture abstract and symbolic computation (e.g., language) and little contact between proposals for a single canonical circuit and complexities such as differential gene expression across cortex, or the diversity of neurons and synapse types. Here, we evaluate and synthesize diverse evidence for a different way of thinking about neocortical architecture, which we believe to be more compatible with evolutionary and developmental biology, as well as with the inherent diversity of cortical functions. In this conception, the cortex is composed of an array of reconfigurable computational blocks, each capable of performing a variety of distinct operations, and possibly evolved through duplication and divergence. The computation performed by each block depends on its internal configuration. Area-specific specialization arises as a function of differing configurations of the local logic blocks, area-specific long-range axonal projection patterns and area-specific properties of the input. This view provides a possible framework for integrating detailed knowledge of cortical microcircuitry with computational characterizations. With Adam Marblestone, MIT and Tom Dean, Google

Sept. 23 Marcus discussion (postponed)

Feldman background discussion

  • NEW MATERIALs:
  • Feldman, J. (2008). From molecule to metaphor: A neural theory of language. MIT press. (look at Chapters 1, 2, 9, and 26 before class) pdf
  • OLD MATERIALS:
  • Feldman, J. (2013). The neural binding problem (s). Cognitive neurodynamics, 7(1), 1-11. pdf
  • Feldman, J. & Narayanan, S. (2014). Affordances, Actionability, and Simulation. Affordances Workshop, Robotics Science and Systems 2014, Berkeley, CA pdf
  • Feldman, J. (2010). Ecological expected utility and the mythical neural code. Cognitive neurodynamics, 4(1), 25-35. pdf
  • F. T. Sommer: Neural oscillatons and synchrony as mechanisms for coding, communication and computation in the visual system. Chapter in: The New Visual Neurosciences, Eds.: Leo M. Chalupa and John S. Werner, MIT Press (2014) pdf_contents
Sept. 30 Discuss student projects
Wed. Oct. 1, 4:15 p.m., 489 Minor Hall
Feldman lecture: The neural binding problem(s) and related mysteries video

As with many other “problems” in vision and cognitive science, “the binding problem” has been used to label a wide range of tasks of radically different behavioral and computational structure. These include a “hard” version that is currently intractable, a feature-binding variant that is productive routine science and a variable-binding case that is unsolved, but should be solvable. The talk will cover all these and some related problems that seem intractably hard as well as some that are unsolved, but are being approached with current and planned experiments.

Oct. 7 Feldman discussion

Malik background discussion

  • R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation". Proc. of CVPR 2014. pdf
  • P. Arbelaez, J. Pont-Tuset, J. Barron, F. Marques and J. Malik, "Multiscale Combinatorial Grouping", Proc. of CVPR 2014. pdf
  • B. Hariharan, P. Arbelaez, R. Girshick and J. Malik, Simultaneous Detection and Segmentation. ECCV (7) 2014: 297-312. pdf
  • S. Gupta, R. Girshick, P. Arbelaez, J. Malik: Learning Rich Features from RGB-D Images for Object Detection and Segmentation. ECCV (7) 2014: 345-360. pdf
Tue. Oct. 14, 6 to 8 p.m. 560 Evans
Jitendra Malik lecture: The Three R's of Computer Vision: Recognition, Reconstruction and Reorganization video


Oct. 20, 12-1:30 p.m., Minor 489
Harold Bedell lecture: Contour interaction: as far from the muddling crowd? video

Contour interaction describes the interference with target recognition that occurs in the presence of nearby flanking edges. As one of the pioneers in this research, Flom distinguished between contour interaction and crowding, in which contributions to spatial interference can derive also from additional factors, such as inaccurate eye movements and attentional processes. In the normal fovea, contour interaction and crowding have a similar magnitude and operate over a similar spatial extent. Both foveal contour interaction and crowding are reduced when the luminance of the stimulus is decreased. Unlike the fovea, the magnitude and extent of contour interaction in peripheral vision are considerably more limited than crowding. Further, peripheral contour interaction and crowding are not affected substantively by target luminance. Indeed, the magnitude and extent of peripheral contour interaction are similar for photopic and scotopic targets. These results suggest that the contributions of specific mechanisms may differ for foveal and peripheral contour interaction and crowding.
Readings:

  • Siderov, J., Waugh, S. J., & Bedell, H. E. (2013). Foveal contour interaction for low contrast acuity targets. Vision research, 77, 10-13. pdf
  • Coates, D. R., & Levi, D. M. (2014). Contour interaction in foveal vision: A response to. Vision research, 96, 140-144. pdf
  • Siderov, J., Waugh, S. J., & Bedell, H. E. (2014). Foveal contour interaction on the edge: Response to ‘Letter-to-the-Editor’by Drs. Coates and Levi. Vision research, 96, 145-148. pdf
Oct. 21 Malik discussion

Nakayama and Shimojo background discussion

  • Nakayama, K. (1999). Mid-level vision. In R. A. Wilson & F. C. Keil (Eds.), The MIT encylopedia of the cognitive sciences Cambridge: MIT Press pdf
  • Nakayama, K. (2010) "Vision going social." The science of social vision. Adams, R.B. Jr., Ambady, N., Nakayama, K. & Shimojo, S. (Eds) Oxford University Press pdf
  • Nakayama, K. and Martini, P. (2011) Situating Visual Search. Vision Research, 51, 1526-1537. pdf

(All Nakayama pubs available here)

  • Shimojo, S. (2014). Postdiction: its implications on visual awareness, hindsight, and sense of agency. Frontiers in psychology, 5. pdf
Oct. 28, 6-8 p.m., 560 Evans
Ken Nakayama lecture: The scientist’s choice: solving, explaining, discovering . . . .


Nov. 3 (Monday)

12:00 p.m. 489 Minor Hall


Shinsuke Shimojo lecture: Postdiction: its implications on visual awareness, hindsight, and sense of agency video


Nov. 3 (Monday)

3:30-4:30 p.m. 560 Evans Hall


Discussion with Shinshuke Shimojo


Nov. 4 Nakayama and Shimojo discussion

Wandell background discussion

  • To appear: Computational modeling of responses in human visual cortex. BA Wandell, J Winawer, KN Kay.

In Brain Mapping: An Encyclopedic Reference (Edited by Thompson and Friston.) pdf

(Friday) Nov. 14 11 a.m., 560 Evans Hall
Brian Wandell lecture


Nov. 18 Consciousness discussion
Nov. 25

Koch background discussion

  • Koch, C. Project MindScope pdf
  • Tsuchiya, N., & Koch, C. (2008). The relationship between consciousness and attention. The neurology of consciousness: Cognitive neuroscience and neuropathology, 63-78. pdf
  • Klein, S. A. (1993). Will robots see? Chapter in Spatial Vision in Humans and Robots, Cambridge University Press, 184-199. pdf
  • Tononi, G., & Koch, C. (2014). Consciousness: Here, There but Not Everywhere. arXiv preprint arXiv:1405.7089. pdf
  • Scientific American article
  • Scientific American article
Dec. 2, 4-6 p.m., 125 Li Ka Shing
Christof Koch lecture: Unsolved Problems in Vision: Consciousness.

Evening seminar: Koch Discussion


Additional Materials

  • recent special issue of CurrOpinNeuro journal
  • Olshausen BA Olshausen (2013) Perception as an Inference Problem. pdf
  • Olshausen BA (2012) 20 years of learning about vision: Questions answered, questions unanswered, and questions not yet asked. In: 20 Years of Computational Neuroscience (Symposium of the CNS 2010 annual meeting) pdf
  • Kitaoka, A (2014) Color-dependent motion illusions in stationary images and their phenomenal dimorphism. Perception advance online publication pdf
  • O'Regan, J. K., & Noë, A. (2001). A sensorimotor account of vision and visual consciousness. Behavioral and brain sciences, 24(05), 939-973.pdf
  • Bruno Olshausen lecture (1 July 2014) 20 Years of Learning About Vision: Questions Answered, Questions Unanswered, and Questions Not Yet Asked video
  • Solari, S. V. H., & Stoner, R. (2011). Cognitive consilience: primate non-primary neuroanatomical circuits underlying cognition. Frontiers in neuroanatomy, 5. pdf
  • Dyson, Freeman. The Case for Blunders. The New York Review of Books, 6 March 2014. pdf
  • Machine-Learning Maestro Michael Jordan on the Delusions of Big Data and Other Huge Engineering Efforts, 20 Oct 2014, Lee Gomes, IEEE Specturm link
  • Yann LeCunn responds to Mike Jordan's Spectrum interview link
  • Kravitz, D. J., Saleem, K. S., Baker, C. I., Ungerleider, L. G., & Mishkin, M. (2013). The ventral visual pathway: an expanded neural framework for the processing of object quality. Trends in cognitive sciences, 17(1), 26-49. pdf
  • Vinyals, O. et al. Show and Tell: A Neural Image Caption Generator. 2014 arXiv.1411.4555v1 pdf
  • Koch, C., & Tononi, G. (2011). A test for consciousness. Scientific American, 304(6), 44-47. [pdf https://www.dropbox.com/s/h2bo3swrjr1g1l1/A_Test_for_Consciousness.pdf?dl=0]