哈尔滨工业大学计算机专业课 复试 Computer vision

本站小编 免费考研网/2020-02-22

Computer vision

From Wikipedia, the free encyclopedia

Computer vision is a field that includes methods for acquiring, processing, analyzing, and understanding images and, in general, high-dimensional data from the real world in order to produce numerical or symbolic information, e.g., in the forms of decisions.A theme in the development of this field has been to duplicate the abilities of human vision by electronically perceiving and understanding an image.[5] Understanding in this context means the transformation of visual images (the input of retina) into descriptions of world that can interface with other thought processes and elicit appropriate action. This image understanding can be seen as the disentangling of symbolic information from image data using models constructed with the aid of geometry, physics, statistics, and learning theory.[6] Computer vision has also been described as the enterprise of automating and integrating a wide range of processes and representations for vision perception.[7]

As a scientific discipline, computer vision is concerned with the theory behind artificial systems that extract information from images. The image data can take many forms, such as video sequences, views from multiple cameras, or multi-dimensional data from a medical scanner. As a technological discipline, computer vision seeks to apply its theories and models to the construction of computer vision systems.

Sub-domains of computer vision include scene reconstruction, event detection, video tracking, object recognition, object pose estimation, learning, indexing, motion estimation, and image restoration.

Representational and control requirements[edit]

Image-understanding systems (IUS) include three levels of abstraction as follows: Low level includes image primitives such as edges, texture elements, or regions; intermediate level includes boundaries, surfaces and volumes; and high level includes objects, scenes, or events. Many of these requirements are really topics for further research.

The representational requirements in the designing of IUS for these levels are: representation of prototypical concepts, concept organization, spatial knowledge, temporal knowledge, scaling, and description by comparison and differentiation.

While inference refers to the process of deriving new, not explicitly represented facts from currently known facts, control refers to the process that selects which of the many inference, search, and matching techniques should be applied at a particular stage of processing. Inference and control requirements for IUS are: search and hypothesis activation, matching and hypothesis testing, generation and use of expectations, change and focus of attention, certainty and strength of belief, inference and goal satisfaction.[10]

Typical tasks of computer vision[edit]

Each of the application areas described above employ a range of computer vision tasks; more or less well-defined measurement problems or processing problems, which can be solved using a variety of methods. Some examples of typical computer vision tasks are presented below.

Recognition[edit]

The classical problem in computer vision, image processing, and machine vision is that of determining whether or not the image data contains some specific object, feature, or activity. Different varieties of the recognition problem are described in the literature:

l    Object recognition (also called object classification) – one or several pre-specified or learned objects or object classes can be recognized, usually together with their 2D positions in the image or 3D poses in the scene. Blippar, Google Goggles and Like That provide stand-alone programs that illustrate this functionality.

l    Identification – an individual instance of an object is recognized. Examples include identification of a specific person's face or fingerprint, identification of handwritten digits, or identification of a specific vehicle.

l    Detection – the image data are scanned for a specific condition. Examples include detection of possible abnormal cells or tissues in medical images or detection of a vehicle in an automatic road toll system. Detection based on relatively simple and fast computations is sometimes used for finding smaller regions of interesting image data which can be further analyzed by more computationally demanding techniques to produce a correct interpretation.

Currently, the best algorithms for such tasks are based on convolutional neural networks. An illustration of their capabilities is given by the Image Net Large Scale Visual Recognition Challenge; this is a benchmark in object classification and detection, with millions of images and hundreds of object classes. Performance of convolutional neural networks, on the ImageNet tests, is now close to that of humans.[11] The best algorithms still struggle with objects that are small or thin, such as a small ant on a stem of a flower or a person holding a quill in their hand. They also have trouble with images that have been distorted with filters (an increasingly common phenomenon with modern digital cameras). By contrast, those kinds of images rarely trouble humans. Humans, however, tend to have trouble with other issues. For example, they are not good at classifying objects into fine-grained classes, such as the particular breed of dog or species of bird, whereas convolutional neural networks handle this with ease.

Several specialized tasks based on recognition exist, such as:

l    Content-based image retrieval – finding all images in a larger set of images which have a specific content. The content can be specified in different ways, for example in terms of similarity relative a target image (give me all images similar to image X), or in terms of high-level search criteria given as text input (give me all images which contains many houses, are taken during winter, and have no cars in them).

Computer vision for people counter purposes in public places, malls, shopping centres

l    Pose estimation – estimating the position or orientation of a specific object relative to the camera. An example application for this technique would be assisting a robot arm in retrieving objects from a conveyor belt in anassembly line situation or picking parts from a bin.

l    Optical character recognition (OCR) – identifying characters in images of printed or handwritten text, usually with a view to encoding the text in a format more amenable to editing or indexing (e.g. ASCII).

l    2D Code reading Reading of 2D codes such as data matrix and QR codes.

l    Facial recognition

l    Shape Recognition Technology (SRT) in people counter systems differentiating human beings (head and shoulder patterns) from objects

Computer vision system methods[edit]

The organization of a computer vision system is highly application dependent. Some systems are stand-alone applications which solve a specific measurement or detection problem, while others constitute a sub-system of a larger design which, for example, also contains sub-systems for control of mechanical actuators, planning, information databases, man-machine interfaces, etc. The specific implementation of a computer vision system also depends on if its functionality is pre-specified or if some part of it can be learned or modified during operation. Many functions are unique to the application. There are, however, typical functions which are found in many computer vision systems.

l    Image acquisition – A digital image is produced by one or several image sensors, which, besides various types of light-sensitive cameras, includerange sensors, tomography devices, radar, ultra-sonic cameras, etc. Depending on the type of sensor, the resulting image data is an ordinary 2D image, a 3D volume, or an image sequence. The pixel values typically correspond to light intensity in one or several spectral bands (gray images or colour images), but can also be related to various physical measures, such as depth, absorption or reflectance of sonic or electromagnetic waves, or nuclear magnetic resonance.[12]

l    Pre-processing – Before a computer vision method can be applied to image data in order to extract some specific piece of information, it is usually necessary to process the data in order to assure that it satisfies certain assumptions implied by the method. Examples are

l    Re-sampling in order to assure that the image coordinate system is correct.

l    Noise reduction in order to assure that sensor noise does not introduce false information.

l    Contrast enhancement to assure that relevant information can be detected.

l    Scale space representation to enhance image structures at locally appropriate scales.

l    Feature extraction – Image features at various levels of complexity are extracted from the image data.[12] Typical examples of such features are

l    Lines, edges and ridges.

l    Localized interest points such as corners, blobs or points.

More complex features may be related to texture, shape or motion.

l    Detection/segmentation – At some point in the processing a decision is made about which image points or regions of the image are relevant for further processing.[12] Examples are

l    Selection of a specific set of interest points

l    Segmentation of one or multiple image regions which contain a specific object of interest.

l    Segmentation of image into nested scene architecture comprised foreground, object groups, single objects or salient object parts (also referred to as spatial-taxon scene hierarchy)[13]

l    High-level processing – At this step the input is typically a small set of data, for example a set of points or an image region which is assumed to contain a specific object.[12] The remaining processing deals with, for example:

l    Verification that the data satisfy model-based and application specific assumptions.

l    Estimation of application specific parameters, such as object pose or object size.

l    Image recognition – classifying a detected object into different categories.

l    Image registration – comparing and combining two different views of the same object.

l    Decision making Making the final decision required for the application,[12] for example:

l    Pass/fail on automatic inspection applications

l    Match / no-match in recognition applications

l    Flag for further human review in medical, military, security and recognition applications


相关话题/哈尔滨工业大学 计算机

  • 领限时大额优惠券,享本站正版考研考试资料!
    大额优惠券
    优惠券领取后72小时内有效,10万种最新考研考试考证类电子打印资料任你选。涵盖全国500余所院校考研专业课、200多种职业资格考试、1100多种经典教材,产品类型包含电子书、题库、全套资料以及视频,无论您是考研复习、考证刷题,还是考前冲刺等,不同类型的产品可满足您学习上的不同需求。 ...
    本站小编 Free壹佰分学习网 2022-09-19
  • 哈尔滨工业大学计算机专业课 复试 2016HITCS机试试题及答案含部分解析
    /* * Theme:2016HIT复试机试试题、答案及部分解析 * Author:HITCS * Modify:Biechuyangwang(心理咨询木木) * Remark:如有疑问2018HIT群里提问或私聊,改错题不想写,有兴趣可以私聊 */ 试卷:2016-HITCS 单项选择(120分) 1.(2分)在OSI参考模型中,实现路由功能的是 A. 会话层 B. 传输层 ...
    本站小编 免费考研网 2020-02-22
  • 哈尔滨工业大学计算机专业课 复试 2015HITCS
    /* * Theme:2015HIT复试机试试题、答案及部分解析 * Author:HITCS * Modify:Biechuyangwang(心理咨询木木) * Remark:如有疑问2018HIT群里提问或私聊,改错题不想写,有兴趣可以私聊 */ 试卷:2015-HITCS 单项选择(130分) 1. 设A,B为集合,使下列两式A\B=和(AB)\B=(A\B) ...
    本站小编 免费考研网 2020-02-22
  • 哈尔滨工业大学计算机专业课 复试 2016机试附加题
    原本没觉得会有多少人会看附加题,毕竟大佬不需要,跨考要了也没用。 但是事实是很多私聊我要附加题,由于确实没去AC,就索性没有回各位。 现在有时间了,把附加题AC了一下,放在blog上,有人反映404,应该是审核还没通过 所以写了这个文档作为统一回复。 最后一点,求求各位女装大佬或女大佬别加我好友,问问题 ...
    本站小编 免费考研网 2020-02-22
  • 哈尔滨工业大学应用经济学原理849经验分享
    19考研我的总分412分,本部国贸初试第一,已录取。我的各科成绩比较平均,没有特别突出的,也没有拉后腿的,70+,70+,130,130+。每一科我都认真复习,因为我觉得无论哪一科都是可以拉开差距的,一科差5分,四科就能差20分。事实证明我的想法是正确的,有的同学在复习时把大部分经历放在数学上,结果英语没到60单科没过线, ...
    本站小编 免费考研网 2020-02-20
  • 哈尔滨工业大学2014年808理论力学考试试题
    本站小编 免费考研网 2020-02-19
  • 重庆大学零基础生物跨考计算机初试复试经验
    之前在王道论坛上发表过这篇帖子,但是担心用考研帮的同学没有看过我的经验贴,故特意将此贴重新发表在考研帮,希望能为考研的同学做出一点微薄的贡献,也是一个已经经过考研的学子对学弟学妹的鼓舞,望大家不断坚持,不断奋斗,做更好的自己! 2018零基础生物跨考计算机初试、复试经验: 考研到现在为止 ...
    本站小编 免费考研网 2020-02-18
  • 北京邮电大学21计算机考研经验分享!
    作者: 北邮学长66 时间: 2020-2-12 13:55 北邮有四个学院属于计算机类,分为两大类,一类是考专业课803,另一类是考专业课807。803包括四门课,807包括两门课(803/807是我们专业课的代码)。 考803的有计算机学院、网络技术研究院、网络空间安全学院,这三个学院的学硕和专硕初试考的一模一样,复试都是笔试、机试和面试 ...
    本站小编 免费考研网 2020-02-12
  • 东北大学计算机2017-2019考研真题
    东北大学2017年真题 C语言程序设计部分(75分) 一、简答题,每题5分。 1.将一个指针指向一个数组,举例写出如何表示数组中的元素(至少3种) 2.试说明什么是指针,哪些是指针,哪些不是,怎么样使用指针,请举例说明。 3.为什么要声明函数?哪些函数不需要声明,哪些函数需要说明,请举例说明。 二、给出程序运行结 ...
    本站小编 免费考研网 2020-02-08
  • 电子科技大学计算机考研复试经验贴
    成都电子科大计算机人工智能方向研究生在读,真人现身说法,看看复试时怎么让老师被你牵着走!先例子后总结,如果有用欢迎点评论区留言哦! 复试时候,由于我是跨考,所以面试前贼紧张我们当时的面试是先问一个专业课问题,然后你自我介绍之后老师进行自由提问,时间是15min就左右。 因为我报考得是成 ...
    本站小编 免费考研网 2020-02-06
  • 2018云南大学计算机程序设计904回忆考研试题
    本站小编 免费考研网 2020-01-31
  • 电子科技大学计算机820考研经验贴
    本人2019年考研,初试分数386,初试加复试总分815、总排名第二。目前电科计算机科学与技术专业机器智能与模式识别方向研一在读。 本科是电科机电学院(上课时学过C语言),可以说基本是零基础跨考。从18年3月份决定跨考到最后成功上岸,一路以来走过不少弯路,也逐步在曲折中摸索出一些经验。 一 整体时间规划 对于 ...
    本站小编 免费考研网 2020-01-29
  • 2020哈尔滨工业大学计算机考研试题854计算机基础
    2020 哈尔滨工业大学 854 真题回忆版 第 I 部分 计算机系统与基础 一、选择题(10 道题,每道 1 分) 1.objdump 反编译的程序中,有个常数1024(RIP),请问是什么。 A 局部变量 B 局部静态变变量 C 全局变量 D 都不是 2 linux 内核的编码方式采用编码 A unicode B ASCII C utf-8 D utf-32 3 ...
    本站小编 免费考研网 2020-01-15
  • 苏州大学计算机考研初试经验分享
    作者: jvjv 时间: 2020-1-11 10:12 标题: 苏大计算机考研初试经验分享,干货 大三上学期末决定考研,然后就开始选学校了,根据我的经验选学校一定要早点看,因为到后期没有时间了,都在忙着考研复习。机缘巧合下,我的一个亲戚推荐我看一下苏州大学,哈哈,苏大的地理位置很好,还是211,计算机专业也很强,所以我就开 ...
    本站小编 考研帮 2020-01-14
  • 2020哈尔滨工业大学通信803复试经验分享
    【复试笔试167分学姐分享哈工大通信803复试经验从初试经验,考试时感受,现在写到复试经验了,这一路走来胜利就在眼前了,筒子们冲啊!】 复试概述 复试分笔试200和面试150。面试一共分四轮,第一轮自我介绍,第二轮英语面,第三轮专业知识面,第四轮数学建模题,分数比例大概是30:40:40:40。面试分 ...
    本站小编 以梦为马xue 2020-01-13
  • 2020大连海事大学计算机真题回忆
    判断题多了一道prim,剩下的题型基本没变,变化的有递归调用函数那块,个人觉得今年程序写的递归输出栈的结果,没有要返回函数。可能我想的不对,自己注意理解吧。 复习方向avl成功失败,树的节点问题,哈希表,最后一道大题,把lb的所有节点逆插到la的第k个节点,程序要会用头插法尾插法。 ...
    本站小编 免费考研网 2020-01-12