EllSeg: An Ellipse Segmentation Framework for Robust Gaze Tracking
- URL: http://arxiv.org/abs/2007.09600v2
- Date: Wed, 4 May 2022 09:14:20 GMT
- Title: EllSeg: An Ellipse Segmentation Framework for Robust Gaze Tracking
- Authors: Rakshit S. Kothari, Aayush K. Chaudhary, Reynold J. Bailey, Jeff B.
Pelz, Gabriel J. Diaz
- Abstract summary: Ellipse fitting is an essential component in pupil or iris tracking based video oculography.
We propose training a convolutional neural network to directly segment entire elliptical structures.
- Score: 3.0448872422956432
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Ellipse fitting, an essential component in pupil or iris tracking based video
oculography, is performed on previously segmented eye parts generated using
various computer vision techniques. Several factors, such as occlusions due to
eyelid shape, camera position or eyelashes, frequently break ellipse fitting
algorithms that rely on well-defined pupil or iris edge segments. In this work,
we propose training a convolutional neural network to directly segment entire
elliptical structures and demonstrate that such a framework is robust to
occlusions and offers superior pupil and iris tracking performance (at least
10$\%$ and 24$\%$ increase in pupil and iris center detection rate respectively
within a two-pixel error margin) compared to using standard eye parts
segmentation for multiple publicly available synthetic segmentation datasets.
Related papers
- Towards Unsupervised Eye-Region Segmentation for Eye Tracking [9.051786094550293]
We use priors of human eye and extract signals from the image to establish rough clues indicating the eye-region structure.
A segmentation network is trained to gradually identify the precise area for each part.
Experiments show that our unsupervised approach can easily achieve 90% (the pupil and iris) and 85% (the whole eye-region) of the performances under supervised learning.
arXiv Detail & Related papers (2024-10-08T15:33:23Z) - CondSeg: Ellipse Estimation of Pupil and Iris via Conditioned Segmentation [9.680930476240674]
Parsing eye components (i.e. pupil, iris and sclera) is fundamental for eye tracking and gaze estimation for AR/VR products.
In this paper, we consider two priors: projected full pupil/iris circle can be modelled with ellipses (ellipse prior), and the visibility of pupil/iris is controlled by openness of eye-region.
We propose CondSeg to estimate elliptical parameters of pupil/iris directly from segmentation labels, without explicitly annotating full ellipses.
arXiv Detail & Related papers (2024-08-30T12:17:49Z) - Periocular biometrics: databases, algorithms and directions [69.35569554213679]
Periocular biometrics has been established as an independent modality due to concerns on the performance of iris or face systems in uncontrolled conditions.
This paper presents a review of the state of the art in periocular biometric research.
arXiv Detail & Related papers (2023-07-26T11:14:36Z) - A Simple Framework for Open-Vocabulary Segmentation and Detection [85.21641508535679]
We present OpenSeeD, a simple Open-vocabulary and Detection framework that jointly learns from different segmentation and detection datasets.
We first introduce a pre-trained text encoder to encode all the visual concepts in two tasks and learn a common semantic space for them.
After pre-training, our model exhibits competitive or stronger zero-shot transferability for both segmentation and detection.
arXiv Detail & Related papers (2023-03-14T17:58:34Z) - Self-Supervised Correction Learning for Semi-Supervised Biomedical Image
Segmentation [84.58210297703714]
We propose a self-supervised correction learning paradigm for semi-supervised biomedical image segmentation.
We design a dual-task network, including a shared encoder and two independent decoders for segmentation and lesion region inpainting.
Experiments on three medical image segmentation datasets for different tasks demonstrate the outstanding performance of our method.
arXiv Detail & Related papers (2023-01-12T08:19:46Z) - Segmentation-free Direct Iris Localization Networks [0.0]
This paper proposes an efficient iris localization method without using iris segmentation and circle fitting.
We propose an iris localization network (ILN) that can directly localize pupil and iris circles with eyelid points from a low-resolution iris image.
We also introduce a pupil refinement network (PRN) to improve the accuracy of pupil localization.
arXiv Detail & Related papers (2022-10-19T09:13:39Z) - Two-headed eye-segmentation approach for biometric identification [3.4998703934432682]
This paper introduces the new two-headed architecture, where the eye components and eyelashes are segmented using two separate decoding modules.
Thanks to the two-headed approach, we were also able to examine the quality of the model with the convex prior.
We conducted an extensive evaluation of various learning scenarios on real-life conditions high-resolution near-infrared iris images.
arXiv Detail & Related papers (2022-09-30T13:52:03Z) - Pseudo-label Guided Cross-video Pixel Contrast for Robotic Surgical
Scene Segmentation with Limited Annotations [72.15956198507281]
We propose PGV-CL, a novel pseudo-label guided cross-video contrast learning method to boost scene segmentation.
We extensively evaluate our method on a public robotic surgery dataset EndoVis18 and a public cataract dataset CaDIS.
arXiv Detail & Related papers (2022-07-20T05:42:19Z) - Open-world Semantic Segmentation via Contrasting and Clustering
Vision-Language Embedding [95.78002228538841]
We propose a new open-world semantic segmentation pipeline that makes the first attempt to learn to segment semantic objects of various open-world categories without any efforts on dense annotations.
Our method can directly segment objects of arbitrary categories, outperforming zero-shot segmentation methods that require data labeling on three benchmark datasets.
arXiv Detail & Related papers (2022-07-18T09:20:04Z) - A Deep Learning Approach for the Segmentation of Electroencephalography
Data in Eye Tracking Applications [56.458448869572294]
We introduce DETRtime, a novel framework for time-series segmentation of EEG data.
Our end-to-end deep learning-based framework brings advances in Computer Vision to the forefront.
Our model generalizes well in the task of EEG sleep stage segmentation.
arXiv Detail & Related papers (2022-06-17T10:17:24Z) - SIP-SegNet: A Deep Convolutional Encoder-Decoder Network for Joint
Semantic Segmentation and Extraction of Sclera, Iris and Pupil based on
Periocular Region Suppression [8.64118000141143]
multimodal biometric recognition systems have the ability to deal with the limitations of unimodal biometric systems.
Such systems possess high distinctiveness, permanence, and performance while, technologies based on other biometric traits can be easily compromised.
This work presents a novel deep learning framework called SIP-SegNet, which performs the joint semantic segmentation of ocular traits.
arXiv Detail & Related papers (2020-02-15T15:20:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.