Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network

Meenakshi, J.; Thailambal, G.

doi:10.1007/s13748-024-00336-x

Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network
基于 ASMNet 的 facial fiducial 检测和 Jordan 神经网络进行性别和年龄分类

Regular Paper 常规论文
Published: 29 August 2024
发布日期：2024 年 8 月 29 日

Volume 13, pages 293–306, (2024)
第 13 卷，第 293-306 页，（2024 年）
Cite this article
引用此文章

Download PDF 下载 PDF

Access provided by International Science and Technology Information Center 国际科学技术信息中心提供访问权限

Progress in Artificial Intelligence
人工智能进展 Aims and scope 目标和范围 Submit manuscript 提交稿件

Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network
基于 ASMNet 的 facial fiducial 检测和 Jordan 神经网络进行性别和年龄分类

Download PDF 下载 PDF

J. Meenakshi¹ &
G. Thailambal¹

86 Accesses
Explore all metrics

Abstract 摘要

A capacity to instantly determine someone's age and gender merely by looking at their face has made facial recognition technology essential. Among the difficulties faced by researchers in computer vision and also psychophysics are the observation of human faces and modeling of their characteristic traits. Due to inadequate face fiducial point detection and poor image quality, many of the methods that have been designed in existing models based on facial features for age and gender categorization still face some challenges. Hence, the Active Shape Model (ASMNET) based Jordan neural network was developed for facial fiducial detection. In this designed model, the facial images are considered as input. These images are pre-processed using cropping, center surrounds device normalization, optimized Gabor filter and logarithmic transformation. Based on the preprocessed data, facial fiducial points and distinct features are detected using ASMNet combined with Convolutional Neural Network. Using this primary facial detected landmarks such as eye, mouth, nose tip and lips are extracted for features using EfficientNetB7 and classified based on the Jordan neural network to categorize age and gender. Performance metrics for this designed model include Accuracy, Positive predictive value, Hit rate, Selectivity and Negative Predictive Value. The proposed models achieved performance metrics values are 93%, 87%, 89%, 94.82% and 92.32%. Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network is better than the existing model along with that using this prediction technique the possibility of error rate gets reduced and timely detection can be achieved.
仅通过观察人脸就能瞬间判断出一个人的年龄和性别，这使得人脸识别技术变得至关重要。在计算机视觉和心理学领域的研究者面临的困难中，包括观察人脸和建模其特征。由于人脸关键点检测不足和图像质量差，许多基于面部特征进行年龄和性别分类的方法在现有模型中仍然面临一些挑战。因此，开发了基于 Active Shape Model (ASMNET)的 Jordan 神经网络进行人脸关键点检测。在这个设计的模型中，面部图像被视为输入。这些图像使用裁剪、中心环绕设备归一化、优化 Gabor 滤波器和对数变换进行预处理。基于预处理数据，使用 ASMNet 结合卷积神经网络检测面部关键点和独特特征。使用此主要面部检测到的特征点，如眼睛、嘴巴、鼻尖和嘴唇，通过 EfficientNetB7 提取特征，并根据 Jordan 神经网络进行分类以区分年龄和性别。该设计模型的性能指标包括准确率、阳性预测值、命中率、选择性和阴性预测值。所提出的模型在性能指标上的值分别为 93%、87%、89%、94.82%和 92.32%。基于 ASMNet 的面部特征点检测和 Jordan 神经网络进行性别和年龄分类优于现有模型，同时使用这种预测技术可以降低错误率并实现及时检测。

Face Detection and Facial Feature Extraction with Machine Learning
人脸检测与面部特征提取基于机器学习

A Comparative Analysis of Recent Face Detection Methods Implemented for Age and Gender Detection
《近期用于年龄和性别检测的人脸检测方法的比较分析》

Neural networks for facial age estimation: a survey on recent advances
神经网络在人脸年龄估计中的应用：近期进展综述

Article 26 September 2019
文章 2019 年 9 月 26 日

Discover the latest articles, news and stories from top researchers in related subjects.
发现相关领域顶尖研究人员发布的最新文章、新闻和故事。

Artificial Intelligence
人工智能

Use our pre-submission checklist
使用我们的投稿清单

Avoid common mistakes on your manuscript.
避免在您的稿件中犯常见错误。

1 Introduction 1 引言

One of the most significant tasks in computer vision is human face analysis because it is a highly deformable object that requires automatic analysis. Characterizing factors such as age, gender, facial features, expressions, clothing as well as personality are important in a variety of applications, including face tracking, behavior recognition, user identification and social interaction [1]. Gender and age are seen to be crucial biometric traits for identifying individuals. Bio-metric recognition is the process of gathering information about a person unique physiological and behavioral characteristics for purposes of human identification in addition to verification (security models) [2]. There are two types of biometrics: hard biometrics (physical, behavioral, and biological) and soft biometrics (age, gender, ethnicity, height as well as face measures) [3]. Soft-biometric characteristics, such as skin tone, hair type, distance between nose and eye, facial form in addition to so on, can be used to categorize unlabeled subjects for different genders also age groups and to speed up data traversal.
计算机视觉中最具意义的任务之一是人脸分析，因为它是一个高度可变形的物体，需要自动分析。在包括人脸追踪、行为识别、用户识别和社会互动在内的各种应用中，年龄、性别、面部特征、表情、服装以及个性等特征描述非常重要 [1]。性别和年龄被视为识别个体的关键生物特征。生物识别识别是收集有关个人独特生理和行为特征的信息的过程，目的是进行人类识别，以及验证（安全模型）[2]。生物识别有两种类型：硬生物识别（物理、行为和生物）和软生物识别（年龄、性别、种族、身高以及面部测量）[3]。如肤色、发质、鼻眼距离、面部形状等软生物特征，可以用于对未标记的个体进行不同性别和年龄组的分类，并加快数据传输。

Face, age and gender detection algorithms have a significant influence on the gender and age rage that is reflected in the user photo validation process [4]. An essential first step in determining age and gender is facial recognition [5]. Numerous techniques, including Haar Cascade, Convolutional Neural Networks (CNN) [6], Histogram of Oriented Gradients (HoG) also Deep Neural Networks (DNN) can be used to accomplish this task [7]. Each algorithm has pros and cons of its own. Neural Convolution Architecture CNN is a prominent deep learning system that can automatically learn to do classification tasks based just on images [8]. Because it treats the prediction of age and gender range as a two-class classification problem, this algorithm will be quite helpful in our situation [9]. For gender, there are males and females, and for age ranges, there are various classes.
面部、年龄和性别检测算法对用户照片验证过程中反映的性别和年龄范围有重大影响[4]。确定年龄和性别的一个基本步骤是面部识别[5]。包括 Haar Cascade、卷积神经网络（CNN）[6]、方向梯度直方图（HoG）以及深度神经网络（DNN）在内的多种技术都可以用于完成这项任务[7]。每种算法都有其自身的优缺点。神经网络卷积架构 CNN 是一种突出的深度学习系统，它可以根据图像自动学习进行分类任务[8]。因为它将年龄和性别的预测视为一个二分类问题，所以这个算法在我们的情况下将非常有帮助[9]。对于性别，有男性和女性，对于年龄范围，有各种类别。

A primary drawback is that CNNs are highly effective in displaying images, it is challenging to adequately train a classic big-scale CNN with small datasets [10]. However, due to many reasons, obtaining a large scale of photos with age and gender identifiers is challenging [7]. The training data amount for the gender and age prediction problem are typically small, and the face images of the same individuals only span a small range of ages [8]. The models frequently experience overfitting since it is difficult to take advantage of the most universal characteristics of the age with small data [9]. The second drawback is that the current techniques take the entire image as an input for the network, and the intricate background information significantly impedes the process of extracting features and further impairs prediction ability [10]. As a result, if the feature extractor uses the original image as input, the age prediction will be affected. The third drawback is that most CNN-based techniques use the output from the network's final fully connected layer to represent the image [11]. To address the aforementioned limitations, Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network. Major contributions of the designed model are
一个主要缺点是 CNN 在显示图像方面非常有效，但使用小数据集训练经典的大规模 CNN 具有挑战性[10]。然而，由于许多原因，获取带有年龄和性别标识的大规模照片具有挑战性[7]。性别和年龄预测问题的训练数据量通常很小，同一个人的面部图像只覆盖了很小的年龄范围[8]。由于难以利用小数据中最普遍的年龄特征，模型经常出现过拟合[9]。第二个缺点是，当前技术将整个图像作为网络的输入，复杂的背景信息极大地阻碍了特征提取过程，并进一步损害了预测能力[10]。因此，如果特征提取器使用原始图像作为输入，年龄预测将受到影响。第三个缺点是，大多数基于 CNN 的技术使用网络最终全连接层的输出来表示图像[11]。为了解决上述局限性，使用基于 ASMNet 的人脸关键点检测和 Jordan 神经网络进行性别和年龄分类。该模型的主要贡献是

Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network.
Initially, the images are pre-processed using cropping, centre surround device normalization, optimized Gabor filter and logarithmic transformation to improve the image quality.
Center surround device normalization is used for normalizing the pixels in the image based on the centre point for linearizing the pixels of the facial images.
Optimized Gabor filter is used for removing the noise from the facial images based on the optimization of the orientation value using lyre bird optimization and Log transformations is used to enhance the contrast of dark images.
Facial Fiducial and pose from the facial images are detected using Active Shape Model combined with CNN (ASMNet) for learning the distinctive features of an image.
EfficientNetB7 and Jordan Neural Networks are used to extract features and categorize age and gender of the people.

The remaining portions of the paper are organized in the following manner several articles on facial fiducial detection are reviewed in Sect. 2; the method that has been proposed for an efficient security testing process is briefly explained in Sect. 3 In Sect. 4, the experimental results are presented for the developed Facial Fiducial Detection model; The entire study article is concluded in Sect. 5.

2 Literature review

Majority of the publications on Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network are studied in this field, and below is an evaluation of some articles along with their drawbacks.

Duan et al. [12] introduced a hybrid architecture that combines the strengths of two classifiers Convolutional Neural Network (CNN) and Extreme Learning Machine (ELM) to handle the classification of age and gender. By using CNN to extract features from the input images and ELM to classify the intermediate outputs, the hybrid architecture maximizes their respective benefits. This implements the design effectively and takes several precautions to reduce the probability of overfitting. This involves developing several layers and variables by examining the hybrid architecture and determining the back-propagation functions in this framework through iterations. Next, the hybrid structure is verified using two widely used datasets, namely MORPHII and Adience Benchmark.

Hassan et al. [13] involved several CNN method for classifying people based on their age and gender. The five stages of the described method are as follows: face alignment, multiple CNN, face detection, background removal, and voting systems. With three distinct CNNs in terms of depth and structure, multiple CNN model aims to extract different features for every network. After training each network independently on the AGFW dataset, predictions are combined using the voting mechanism to determine the outcome.

Khan et al. [14] designed a unified system using end-to-end semantic face segmentation for face picture analysis. A collection of stack components for face comprehension, including as head posture estimation, age categorization as well as gender recognition, are included in the suggested framework. The segmentation model based on Conditional Random Fields (CRFs) is trained using a manually labelled face data-set. A face image is divided into six segments using a multi-class face segmentation framework created using CRFs. For every class, probability maps are created using the probabilistic classification technique. To each task (head posture, age, and gender), a RDF classifier is modelled based on the probability maps as feature descriptors.

Nada et al. [15] developed an approach to confirm that the user's age range as well as gender are accurately reflected in his photo. Additionally, a double-check layer validator based on the Deep Learning approach is added by creating a link between the user photo, gender as well as date of birth form inputs. This is done by utilizing a Convolutional Neural Network (CNN or ConvNets) to recognize the gender also estimate the age from a single person's photo. Furthermore, a web API is built to facilitate the validation process. Using the images of University of Palestine students, it finally assessed this solution and found that, while it has some issues with age prediction, it has done a fantastic job with gender prediction.

Haseena et al. [16] developed to provide people nourishing food according to their age and gender as inferred from their facial features. In order to extract features using the deep convolution neural network (DCNN) approach, the presented methodology first pre-processes the input image. After the neural network has extracted the dimensions using the original facial image, the attribute selection approach is carried out based on the hybrid particle swarm optimisation (HPSO) to choose unique and recognisable facial components. A person's age and gender can be determined using support vector machines (SVM).

Majority of the articles evaluated above are related to the Identification of Facial Fiducial model. The ELM model may require a hidden layer with a high level of complexity due to the random initialization of parameters (weights and biases) [12]. CNNs are renowned for being challenging to optimize and for requiring substantial amounts of training data and processing capacity to train [13] 15. Random forests are biased in favour of qualities with higher levels when they include categorical variables with varying numbers of levels in the data [14]. Extended training duration for big datasets. The final model's varying weights and individual influence make it difficult to comprehend and evaluate [16].

3 Proposed methodology

With the emergence of social platforms and social media, automatic age and gender classification has gained significance for a wider range of applications. However, the real-world image performance of current approaches is still not entirely good enough, particularly in light of the significant leaps in efficiency that have recently been observed with regard to the associated task of facial recognition. The process flow of the proposed model is illustrated in Fig. 1.

In this proposed model, the facial images are considered as input data for gender and age prediction. These data sets are pre-processed using cropping, center surrounds device normalization, optimized Gabor filter and logarithmic transformation. Using the cropping technique, the face portion is cropped from the background. Then, center surround device normalization is used for face normalization in a pixel wise manner. Optimized Gabor filter is used for noise removal in which the orientation value is optimally selected using lyre bird optimization. Logarithmic transformation is used for enhancing the contrast of the image. Based on the preprocessed data, facial Fiducial and pose are detected using ASMNet (Active Shape Model combined with CNN). Using this model primary facial landmarks such as eye, mouth, nose tip and lips, the feature extraction is achieved using EfficientNetB7 and classification using Jordan neural network to categorize age and gender.

3.1 Pre-processing

A preliminary stage in the processing of raw data to prepare it for the main phase or further analysis. In this designed model, the pre-processing approaches are cropping, centre surround device normalization, optimized Gabor filter and logarithmic transformation.

3.1.1 Cropping

A process of cropping an illustrated image involves removing undesired exterior regions. In order to enhance framing or composition, direct the viewer's attention to the topic, or alter the size or aspect ratio, an image is said to be "cropped" when its outer edges are removed or modified. To put it another way, photo cropping is the process of enhancing a picture by deleting elements that are not needed.

3.1.2 Center surround device normalization

Land Retinex theory was applied as SSR by [17], using the most recent version. To process the image, a class of centre surround functions is applied, each of which takes an input value (called the centre) and its surrounding neighbourhood (called the surround) to produce its output value. Gaussian function surrounds the defined centre, which is every pixel value. This provides the SSR's mathematical form:

(1)

where SSR is denoted as the Retinex output, input image is represented by and represents the convolution product of and . The latter function has a Gaussian kernel with a basic linear filter:

(2)

where Pixel-by-pixel spatial detail retention is controlled by the empirically established filter standard deviation,, and the normalizing factor maintains the value of 1 for the area under the Gaussian curve.

3.1.3 Optimized gabor filter

Optimized Gabor filter is used for noise removal in which the orientation value is optimally selected using lyre bird optimization. The Gabor filter have edge localization ability and high edge resolution with high accuracy, high degree of extraction and complete and precise boundary.

(i)
Gabor filter

A complex plane wave and a Gaussian-shaped function make up the composite function known as a two-dimensional Gabor filter [18]. It is stated as follows:

(3)

A Gabor filter's orientation is given by , the standard deviation is represented by and , respectively, and the filter centre frequency is indicated by .

The Euler formula states that the Gabor filter can be decomposed into two parts: an imaginary part and a real part. Because finger veins appear like dark ridges in images, this is an excellent application of the real part of the Gabor filter to exploit vein information from an image. The Gabor filter has been rewritten as

(4)

where is represented as the orientation and channel index, respectively and The Gabor filter's centre frequency in the channel is denoted as .

Assuming that a finger-vein image is denoted by , filtered in the channel is represented by the notation . This can obtain

(5)

where convolution operation in two dimensions is represented by . As a result, the Gabor filter produces eight filtered images for a finger vein image.

ii) Lyrebird optimization

The splendid lyrebird and Albert's lyrebird are the two species of lyrebirds that are native to Australia [19]. This wonderful bird belongs to the family Menuridae. Their big tails, which the males fling out in an attraction display, are a sight to observe. They are also highly skilled at mimicking both artificial and natural noises from their surroundings. One of the most recognizable native birds of Australia is the lyrebird, which has a distinctive plume of neutral-colored tail feathers. Males and females of the Superb lyrebird species length 80–98 cm and 74–84 cm, respectively. When it comes to size, the female of Albert's lyrebird may grow to a maximum of 84 cm, while the male can grow a maximum of 90 cm. Similar in all respects to the superb lyrebird, Albert's lyrebird has smaller, less stunning lyrate feathers. They weigh approximately 0.93 kg, whereas superb lyrebirds weigh approximately 0.97 kg.

Step 1: Initialization.

Optimized Gabor filter is used for noise removal in which the orientation value (θ) is optimally selected using lyre bird optimization. The orientation value (θ) is considered as attributes X1, X2,…, X4.

(6)

Step 2: Fitness Function.

To evaluate the fitness function, the aforementioned equation is used to maximize accuracy based on the k-fold validation.

(7)

(8)

Dataset D is randomly divided into k almost equal-sized, mutually exclusive subsets (the folds),, in k-fold cross-validation, also known as rotation estimation. For each , inducer is tested k times on and . Cross-validation estimate of accuracy is computed using the total count of instances in the dataset. is, logically, the test set containing the instance and ensuing cross-validation accuracy determine.

Step 3: Updating.

Following the Fitness function process, Eq. (9) is used to update all subsequent sets of attributes. Using the lyrebird optimization process, the updating equation is determined.

(9)

Step 4: Termination.

After attaining the hyperparameters of the Gabor filter for optimally learning the information from the facial images, the entire process will be terminated.

3.1.4 Logarithmic transformation

Log transformations are one of the fundamental spatial image enhancement techniques that can be used to enhance the contrast of dark images. The gray levels of the image pixels are changed by the log transform, which is actually a gray level transform [20]. This transformation converts a limited range of low level gray values in the input image to a larger range of output levels. At greater input gray levels, the converse is true. As a result, the darker input values are dispersed into the higher gray level values, enhancing the image's overall brightness and contrast. Mathematically, the log transformation's general form is expressed as

(10)

where, is denoted as the output grey level, c is a constant and is represented as the input grey level. Assumed to be In image processing, the general goal of logarithmic transformation is to improve the visual appearance of images by varying intensity values to increase contrast and bring out details that have been hidden in the original image. For classification, dataset should be preprocessed to obtain the accurate outcome, so the logarithmic transformation is used. However this transformation is complex and time consuming, but the model is more effective for the contrast enhancement of input image and accurate classification of age and gender using JNN model.

3.1.5 ASM network

A statistical representation of shape objects is the active shape model. , which are aligned into a common coordinate system, depict each shape as points. The covariance matrix derived from a set of K training shape samples is evaluated by principle component analysis (PCA) in order to simplify the issue and identify shape components. If the model is constructed, Eq. 11 is used to approximate any training sample (S):

(11)

where the sample mean is represented by , the covariance matrix has eigenvectors, and represented as a dimensional vector provided by Eq. 12:

(12)

Consequently, vector defines a set of parameters for a deformable model, allowing the model's shape to be altered by adjusting the vector's constituent elements.

Take into consideration the ith parameter statistical variation (eigenvalue) of to be . Generally, vector’s parameter is restricted to in order to ensure that the image created when using ASM is reasonably comparable to the ground truth [7]. The developed shape can resemble the ones in the original training set owing to this restriction. Therefore, using this restriction, we generate a new shape in accordance with Eq. 13:

(13)

where represents the restricted . It defines the ASM operator as well, based on Eq. 14:

(14)

Using Eqs. 11, 12, and 13, ASM converts every given input point into a new point . Based on this algorithm, the facial fiducial and pose points are detected.

3.2 Feature extraction

The process of dimensionality reduction includes feature extraction, which is breaking up an initial set of preprocessed data into smaller, easier-to-manage groups.

3.2.1 EfficientNetB7

Given that Efficient Net [21] model is one of the most sophisticated models also achieves an accuracy score of 84.4% with 66 M parameters in ImageNet dataset classification test, it can be viewed as a collection of CNN models. Eight models that comprise EfficientNet model range in value from B0 to B7; while accuracy climbs sharply with increasing model count, the number of predicted parameters does not. Instead of using the Rectifier Linear Unit (ReLu) activation function, EfficientNet uses a novel one termed the Leaky ReLu activation function [22]. In contrast to other cutting-edge models, EfficientNet generates more efficient outcomes by evenly scaling width, resolution, and depth as the model is reduced in size. The first step in using the compound scaling technique with a fixed resource limitation is to search for a grid that shows the relationships between the baseline network's various scaling dimensions. EfficientNet utilized the main building block introduced by MobileNet V2, the MBConv bottleneck, but it was utilized much more than MobileNet V2 due to the larger "Floating point operations per second" (FLOPS) budget. Because blocks in MBConv are composed of a layer that expands and then compresses the channels, direct connections are employed between bottlenecks with significantly fewer channels than expansion layers. Computation is lowered by the K2 factor, where k is the kernel size and denotes the 2D convolution window's width and height, as the layers' designs split apart.

EfficientNet is described mathematically in (Eq. (15)) as:

(15)

In this case, times are repeated in the variance of , and stands for the layer mean. Represents the shape input with respect to layer in the tensor of . Images' inputs are converted from the layers must scale with a proportionate ratio optimized using the provided formula in order to increase the model's accuracy.

(16)

(17)

FLOPS (P) < = destinated_flops.

Memory (P) < = destinated_memory.

Equation (16) uses the values x, y, and z to indicate the height, width, and resolution. Equation (17) displays the number of layers employed in the model together with parameter details. Figure 2 shows a systematic diagram of EfficientNet B7.

3.3 Classification

Jordan developed a novel circular neural network by fusing the distributed parallel processing theory with the Hopfield network storage notion [23]. Input layer, output layer, hidden layer, also context layer are the four components that make up a Jordan neural network. The connection between the output layer and the context layer has a first-order delay operator, allowing the context layer to hold the output layer's data. Within the neural network, there are two different kinds of activation functions: nonlinear and linear. In this work, output layer uses a linear function, whereas hidden layer uses a sigmoid nonlinear activation function, denoted by formula . Figure 3 displays the topology of the Jordan neural network.

For Jordan neural network, ^T stands for the input vector, where and hidden layer's output vector is represented by vector . These are weights from the input layer neuron to to first hidden layer neuron . These are weights from the context layer neuron to ith first hidden layer neuron . Hidden neuron (i) weights to output layer neuron are . Deep layer neuron (i) biases are represented by . Biases of the output layer and context layer are denoted by , respectively. Activation functions of the output layer also context layer are typically linear functions, but the activation function of hidden layer neuron (i) is represented as . Hidden layer neuron's (i) value is given by . Output layer and context layer values are represented by .

To facilitate clear communication and easy writing, the following indicators will be shown.

(18)

(19)

(20)

(21)

Thus, the Jordan neural network can be acquired;

(22)

(23)

(24)

Figure 4, shows a thorough schematic of the Jordan neural network's first three phases to help you better comprehend Eq. (24).

Takens theorem of embedding yields the following equation:

(25)

In this case, Jordan neural network's fitting value at time is represented as . Furthermore, smooth mapping can be written as in

(26)

Thus, by using this Jordan neural network, the age and gender prediction based on the facial images are trained and tested.

4 Result and discussion

Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network. Python 3.8.8 is used to evaluate a desired model, together with a 2.50 GHz Intel(R), Core(TM) i5-10300H processor, 32.0 GB of RAM (31.8 GB of which is useable), and the following specifications: 32 GB of memory. The collected datasets are pre-processed using cropping, center surrounds device normalization, optimized Gabor filter and logarithmic transformation. Based on the preprocessed data, facial Fiducial and pose are detected using ASMNet (Active Shape Model combined with CNN). Then, feature extraction is achieved using EfficientNetB7 and classification using the Jordan neural network to categorize age and gender.

iDataset Description

Face image is considered as input and is included in the dataset utilized in this framework [24]. UTKFace is an enormous face dataset with a wide age range (0–116 years old).More than 20,000 face images with ethnicity, gender, and age annotations make up the dataset. The photographs exhibit a wide range of variations in terms of clarity, occlusion, lighting, facial expression, and posture. Many tasks, such as age estimation, face detection, landmark localization, age regression/progression, etc., could be performed using this dataset. For the proposed model, consider dataset as 10,136 data that the classifier used to create its best predictions involved age and gender. In this case, 80% (8108) is employed for training and 20% (2028) for testing.

Figure 5 demonstrates Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network. The first columns of the image in Fig. 5 displays the original image, the 2nd column represents the cropping image. The 3rd column provides the Gabor filter images and the 4th column represents Logarithmic transformation images then final column shows segmented images of facial Fiducial and pose detection.

Table 1 represents the hyperparameters of the Jordan Neural Network to detect the age and gender of the people. Parameter are activation function as tanh, adam optimizer, mean square error loss, epochs as 200 and batch size is 32.

Table 1 Hyperperameters of Jordan neural network

Full size table

The proposed model confusion metrics are shown in Fig. 6. A confusion metre helps visualise the outcomes of different expected results by providing a tabular arrangement for them. It compiles all of the predicted and actual values of a classifier into a table. The total quantity of data used for testing is 2028, of which 1906 are anticipated according to the actual class while the remaining 122 are incorrectly predicted.

The Receiver Operating Characteristic Curve (ROC) for the facial fiducial prediction of face images is shown in Fig. 7. An ROC curve (receiver operating characteristic curve) is a graph showing the performance of a classification model at all classification thresholds.

In Fig. 8, measurements of the Detection rate are used to compare the ASMNet and the existing model. The ASMNet values for the existing model for the MTCNN and PKPCA are 86, 94, 89, 90 and 80, 85, 86, 87 and 83, 81, 77, and 73. As a result, the ASMNet is more perfect when compared to the existing approaches. Figure 9 depicts the Detection failure rate comparison of the ASMNet with the existing techniques. The ASMNet and the existing techniques each have Detection failure rate values of 14, 6, 11, 10 and 20, 15, 14, 13 and 17, 19, 23, 27, respectively. Based on the obtained Detection failure rate values, the model values are better performance than those of the existing model.

Figure 10 illustrates the comparison based on peak signal-to-noise ratio (PSNR), Signal-to-noise ratio (SNR) and structural content for the proposed Optimized Gabor Filter With Logarithmic Transformation (OGF-LT) and existing techniques include Weighted Gradient Filter (WGF), Weiner Filter (WF), and Non Local Mean (NLM) filter. The PSNR for the proposed OGF-LT and existing approaches such as WGF, WF, and NLM are 69, 65, 57, and 52, respectively. Likewise, the SNR and SC values for the proposed and existing are 67,63,58,55 and 61,54,49,43 respectively. The performance measurements of the proposed OGF-LT and the existing techniques are analyzed, and the results obtained show that the proposed method's outcome is well for improving image quality than that of the other approaches.

Figure 11 represents the accuracy, PPV, Hit Rate, selectivity and NPV evaluation between JNN and existing techniques. The JNN is compared with existing methods including Extended Nearest Neighbor (ENN), Support Vector Machine (SVM) and Multilayer Perceptron (MLP). The achieved accuracy for the JNN and the existing methods are 93.0, 86.0, 83.0, and 80.0 and Positive Predictive values (PPV) for the JNN and the existing methods are 87.0, 81.6, 84.0, and 76.0. The obtained Hit rate for the JNN and the existing methods are 89.0, 85.6, 78.0, and 82.0 and the accomplished Selectivity for the JNN and the existing methods are 94.82, 86.0, 81.39, and 88.0. The achieved Negative Predictive Value (NPV) for the JNN and the existing methods are 92.320, 89.3, 82.39, and 78.3. The achieved accuracy, PPV, Hit Rate, selectivity and NPV for the proposed JNN methods are 93%, 87%, 89%, 94.82% and 92.32%. The JNN model is more accurate for detecting age and gender when compared to the existing approaches interms of PPV, accuracy, Hit Rate, selectivity and NPV.

The fall-out, FOR, Miss Rate, FDR and error comparison between JNN and existing approaches is presented in Fig. 12. For the JNN, ENN, SVM and MLP the false omission rate (FOR) and false discovery rate (FDR) values are, respectively, 7.67, 10.70, 17.60, 21.70 and 13.0, 18.40, 16.0, 24.0. The achieved Fall-out for the JNN and the existing methods are 5.17, 14.0, 18.60, and 12.0 and the acquired Miss Rate for the JNN and the existing methods are 11.0, 14.40, 22.0, and 18.0. The obtained error values for JNN, ENN, SVM and MLP are 7.0, 14.0, 17.0, and 20.0. The JNN has less error, fall-out, FOR, Miss Rate, FDR than the existing methods as a result.

Performance indicators such as F1_score, phi coefficient, kappa, MK, and FM are compared between the proposed JNN and the existing models in Fig. 13. The attained F1_score values for JNN, ENN, SVM and MLP are 87.98, 83.55, 80.88, and 78.88 and also Phi coefficient for the JNN and the existing methods are 84.8, 83.7, 76.34, and 79.9.The Kappa values obtained for the JNN and the existing approaches are 88.3, 78.5, 85.6, and 73.6, respectively. The JNN and existing model's markedness (MK) values are 91.3, 88.2, 81.39, and 85.7, respectively and The Fowlkes-Mallows index (FMI) values are 90.60, 82.199, 87.8, and 81.0. JNN model performs better than the existing model.

A JNN and the existing model are compared in Fig. 14 in terms of training time measurements. The JNN and existing models training times are 230.79, 269.32, 291.63 and 323.76 respectively. The JNN training time value is greater than that of the existing model. The testing and execution time of the JNN is compared with the existing methods in Figs. 15 and 16. For the JNN, ENN, SVM and MLP, the testing times attained are 0.88, 1.38, 0.96, and 1.47 s. And in that order, the overall execution of the proposed and existing model are 231.67, 270.7, 292.59 and 325.23. The JNN model overcomes the existing model in testing and execution time. Hence, based on the comparison between the proposed and existing models, the proposed model is found to validate superior performance metrics. Table 2 illustrates the accuracy comparison of different CNN models varying data size.

Table 2 Accuray comparison of different CNN models varying data size

Full size table

Through this analysis it is shown that high rate of accuracy around 91.5% is achieved when the size of the dataset is larger (1.5 Gb) using Inceptionv3. Accuracy varies based on training process and hyperparameter of the models such as number of layers, optimizer and activation function. In case of using smaller dataset though accuracy is minimal computation complexity is low. Thus it represents trained with less dataset makes the classification accuracy less and time consuming is less [29].

5 Conclusion

Gender and age prediction utilizing face images is based on the unique features of each individual. These features can be used for a variety of purposes, including human–machine interaction, access control, forensic work, preventing identity theft or fraud, and identifying individuals in organizations. But earlier age estimation research relied on handcrafted features for encoding age-related patterns. There are numerous approaches and significant literature regarding the subject. However, biological variances and uncertainty will always be linked with age estimates due to the wide range of face appearance and other intrinsic and extrinsic factors. The proposed model regards the facial image as an input. Cropping, logarithmic transformation, optimal Gabor filter and centre surround device normalization are the pre-processing techniques used on these data sets. The face area is clipped from the background using cropping technique. Next, pixel-by-pixel face normalization is achieved using centre surround device normalization. For the purpose of eliminating noise, an optimized Gabor filter is employed, with lyrebird optimization being utilized to determine the orientation value optimally. A logarithmic alteration is applied to the image to improve contrast. ASMNet (Active Shape Model paired with CNN) detects facial fiducial and position based on preprocessed data. Primary facial landmarks including the lips, nose tip, eye, and mouth are detected using this model. Next, EfficientNetB7 is used to extract features, and a Jordan neural network is used for classification in order to classify age and gender. Performance metrics for this designed model include Accuracy, Positive predictive value, Hit rate, Selectivity, NPV, FOR, FDR, Fall-out, Miss-Rate, F1-Score, Error, Phi-coefficient, Kappa, MK, FM, Testing time, Training time and Execution time. The proposed models achieved performance metrics values are 93, 87, 89, 94.82, 92.32, 7.67, 13, 5.17, 11, 87.98, 70, 84.8, 88.3, 91.3, 90.60, 230.79, 0.88 and 231.67 Seconds. These evaluated values are contrasted with the results of existing methods like ENN, SVM and MLP. Gender and Age Classification using ASMNet based Facial Fiducial Detection and Jordan Neural Network is better than the existing model along with that using this prediction technique the possibility of error rate gets reduced and timely detection can be achieved. Future work should focus on using hybrid deep learning techniques to enhance the model and incorporate additional facial images while extracting local features under different circumstances for recognizing research areas such as human emotions and race.

Availability of data and material

Not applicable.

Code availability

Not applicable.

References

Kärkkäinen, K., Joo, J.: Fairface: face attribute dataset for balanced race, gender, and age. arXiv:1908.04913 (2019)
Carletti, V., Greco, A., Percannella, G., Vento, M.: Age from faces in the deep learning revolution. IEEE Trans. Pattern Anal. Mach. Intell.Intell. 42(9), 2113–2132 (2019)
Article Google Scholar
Afifi, M., Abdelhamed, A.: Afif4: deep gender classification based on adaboost-based fusion of isolated facial features and foggy faces. J. Vis. Commun. Image Represent.Commun. Image Represent. 62, 77–86 (2019)
Article Google Scholar
Kosinski, M.: Facial recognition technology can expose political orientation from naturalistic facial images. Sci. Rep. 11(1), 100 (2021)
Article Google Scholar
Jin, B., Xu, X.: Price forecasting through neural networks for crude oil, heating oil, and natural gas. Meas. Energy 1(1), 100001 (2024)
Article Google Scholar
Karthick, S., Muthukumaran, N.: Deep RegNet-150 architecture for single image super resolution of real-time unpaired image data. Applied Soft Computing. 162, 111837 (2024). https://doi.org/10.1016/j.asoc.2024.111837
Article Google Scholar
Savchenko, A.V.: Facial expression and attributes recognition based on multi-task learning of lightweight neural networks. In: 2021 IEEE 19th International Symposium on Intelligent Systems and Informatics (SISY), pp. 119–124. IEEE (2021)
Jin, B., Xu, X.: Forecasting wholesale prices of yellow corn through the Gaussian process regression. Neural Comput. Appl.Comput. Appl. 36(15), 8693–8710 (2024)
Article Google Scholar
Gupta, S., Thakur, K., Kumar, M.: 2D-human face recognition using SIFT and SURF descriptors of face’s feature regions. Vis. Comput.Comput. 37, 447–456 (2021)
Article Google Scholar
Xu, X., Zhang, Y.: Corn cash price forecasting with neural networks. Comput. Electron. Agric.. Electron. Agric. 184, 106120 (2021)
Article Google Scholar
Abbruzzese, L., Magnani, N., Robertson, I.H., Mancuso, M.: Age and gender differences in emotion recognition. Front. Psychol. 10, 2371 (2019)
Article Google Scholar
Tu, X., Zhao, J., Xie, M., Jiang, Z., Balamurugan, A., Luo, Y., et al.: 3D face reconstruction from a single image assisted by 2D face images in the wild. IEEE Trans. Multimedia 23, 1160–1172 (2020)
Article Google Scholar
Modeste, P., Reitano, S.: Facial expression analysis by k-means clustering on fiducial points of face (2019)
Rizwan, S.A., Alsufyani, N., Shorfuzzaman, M., Alarfaj, M., Jalal, A., Kim, K.: Automatic fiducial points detection for multi-facial expressions via invariant features and multi-layer kernel sliding perceptron. J. Electr. Eng. Technol. 18(1), 651–661 (2023)
Article Google Scholar
Chen, L., Su, H., Ji, Q.: Deep structured prediction for facial landmark detection. In: Advances in Neural Information Processing Systems, Vol. 32 (2019)
Duan, M., Li, K., Yang, C., Li, K.: A hybrid deep learning CNN–ELM for age and gender classification. Neurocomputing 275, 448–461 (2018)
Article Google Scholar
Hassan, K.R., Ali, I.H.: Age and gender classification using multiple convolutional neural network. In: IOP Conference Series: Materials Science and Engineering, Vol. 928, No. 3, p. 032039. IOP Publishing (2020)
Khan, K., Attique, M., Syed, I., Sarwar, G., Irfan, M.A., Khan, R.U.: A unified framework for head pose, age and gender classification through end-to-end face segmentation. Entropy 21(7), 647 (2019)
Article MathSciNet Google Scholar
Nada, A.A., Alajrami, E., Al-Saqqa, A.A., Abu-Naser, S.S.: Age and gender prediction and validation through single user images using CNN. Int. J. Acad. Eng. Res. (IJAER) 4, 21–24 (2020)
Google Scholar
Haseena, S., Saroja, S., Madavan, R., Karthick, A., Pant, B., Kifetew, M.: Prediction of the age and gender based on human face images based on deep learning algorithm. Comput. Math. Methods Med.. Math. Methods Med. 2022, 1–16 (2022)
Google Scholar
Ismail, M.K., Al-Ameen, Z.: Adapted single scale Retinex algorithm for nighttime image enhancement. AL-Rafidain J. Comput. Sci. Math. 16(1), 59–69 (2022)
Google Scholar
Dehghani, M., Bektemyssova, G., Montazeri, Z., Shaikemelev, G., Malik, O.P., Dhiman, G.: Lyrebird optimization algorithm: a new bio-inspired metaheuristic algorithm for solving optimization problems. Biomimetics 8(6), 507 (2023)
Article Google Scholar
Munawar, H.S., Aggarwal, R., Qadir, Z., Khan, S.I., Kouzani, A.Z., Mahmud, M.P.: A gabor filter-based protocol for automated image-based building detection. Buildings 11(7), 302 (2021)
Article Google Scholar
Manikpuri, U., Yadav, Y.: Image enhancement through logarithmic transformation. Int. J. (2014)
Bragatto, T., Cresta, M., Gatta, F.M., Geri, A., Maccioni, M., Paulucci, M.: A 3-D nonlinear thermal circuit model of underground MV power cables and their joints. Electr. Power Syst. Res. 173, 112–121 (2019)
Article Google Scholar
Khushi, H.M.T., Masood, T., Jaffar, A., Rashid, M., Akram, S.: Improved multiclass brain tumor detection via customized pretrained EfficientNetB7 model. IEEE Access (2023)
Wu, W., An, S.Y., Guan, P., Huang, D.S., Zhou, B.S.: Time series analysis of human brucellosis in mainland China by using Elman and Jordan recurrent neural networks. BMC Infect. Dis. 19(1), 1–11 (2019)
Article Google Scholar
UTK Face dataset (2017). UTK: https://susanqq.github.io/UTKFace/. Accessed on 26–12–2023.
Dawson, H.L., Dubrule, O., John, C.M.: Impact of dataset size and convolutional neural network architecture on transfer learning for carbonate rock classification. Comput. Geosci.. Geosci. 171, 105284 (2023)
Article Google Scholar

Download references

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Department of Computer Science, Vels Institute of Science, Technology & Advanced Studies (VISTAS), Pallavaram, Chennai, 600117, India
J. Meenakshi & G. Thailambal

Authors

J. Meenakshi
View author publications
You can also search for this author in PubMed Google Scholar
G. Thailambal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The corresponding author claims the major contribution of the paper including formulation, analysis and editing. The co-authors provides guidance to verify the analysis result and manuscript editing.

Corresponding author

Correspondence to J. Meenakshi.

Ethics declarations

Conflict of interest

The authors declared that they have no conflicts of interest to this work. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Ethical standards

This article is a completely original work of its authors; it has not been published before and will not be sent to other publications until the journal’s editorial board decides not to accept it for publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Meenakshi, J., Thailambal, G. Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network. Prog Artif Intell 13, 293–306 (2024). https://doi.org/10.1007/s13748-024-00336-x

Download citation

Received: 06 February 2024
Accepted: 04 August 2024
Published: 29 August 2024
Issue Date: December 2024
DOI: https://doi.org/10.1007/s13748-024-00336-x

Keywords

Use our pre-submission checklist
使用我们的投稿清单

Avoid common mistakes on your manuscript.
避免在您的稿件中犯常见错误。

Abstract 摘要
1 Introduction 引言
2 Literature review 文献综述
3 Proposed methodology 提出的方法
4 Result and discussion 结果与讨论
5 Conclusion 结论
Availability of data and material
数据与材料可用性
Code availability 代码可用性
References 参考文献
Funding 资助
Author information 作者信息
Ethics declarations 伦理声明
Additional information 附加信息
Rights and permissions 权利与许可
About this article 关于这篇文章

Fig. 1
View in article Full size image
Fig. 2
View in article Full size image
Fig. 3
View in article Full size image
Fig. 4
View in article Full size image
Fig. 5
View in article Full size image
Fig. 6
View in article Full size image
Fig. 7
View in article Full size image
Fig. 8
View in article Full size image
Fig. 9
View in article Full size image
Fig. 10
View in article Full size image
Fig. 11
View in article Full size image
Fig. 12
View in article Full size image
Fig. 13
View in article Full size image
Fig. 14
View in article Full size image
Fig. 15
View in article Full size image
Fig. 16
View in article Full size image

Kärkkäinen, K., Joo, J.: Fairface: face attribute dataset for balanced race, gender, and age. arXiv:1908.04913 (2019)
Carletti, V., Greco, A., Percannella, G., Vento, M.: Age from faces in the deep learning revolution. IEEE Trans. Pattern Anal. Mach. Intell.Intell. 42(9), 2113–2132 (2019)
Article Google Scholar
Afifi, M., Abdelhamed, A.: Afif4: deep gender classification based on adaboost-based fusion of isolated facial features and foggy faces. J. Vis. Commun. Image Represent.Commun. Image Represent. 62, 77–86 (2019)
Article Google Scholar
Kosinski, M.: Facial recognition technology can expose political orientation from naturalistic facial images. Sci. Rep. 11(1), 100 (2021)
Article Google Scholar
Jin, B., Xu, X.: Price forecasting through neural networks for crude oil, heating oil, and natural gas. Meas. Energy 1(1), 100001 (2024)
Article Google Scholar
Karthick, S., Muthukumaran, N.: Deep RegNet-150 architecture for single image super resolution of real-time unpaired image data. Applied Soft Computing. 162, 111837 (2024). https://doi.org/10.1016/j.asoc.2024.111837
Article Google Scholar
Savchenko, A.V.: Facial expression and attributes recognition based on multi-task learning of lightweight neural networks. In: 2021 IEEE 19th International Symposium on Intelligent Systems and Informatics (SISY), pp. 119–124. IEEE (2021)
Jin, B., Xu, X.: Forecasting wholesale prices of yellow corn through the Gaussian process regression. Neural Comput. Appl.Comput. Appl. 36(15), 8693–8710 (2024)
Article Google Scholar
Gupta, S., Thakur, K., Kumar, M.: 2D-human face recognition using SIFT and SURF descriptors of face’s feature regions. Vis. Comput.Comput. 37, 447–456 (2021)
Article Google Scholar
Xu, X., Zhang, Y.: Corn cash price forecasting with neural networks. Comput. Electron. Agric.. Electron. Agric. 184, 106120 (2021)
Article Google Scholar
Abbruzzese, L., Magnani, N., Robertson, I.H., Mancuso, M.: Age and gender differences in emotion recognition. Front. Psychol. 10, 2371 (2019)
Article Google Scholar
Tu, X., Zhao, J., Xie, M., Jiang, Z., Balamurugan, A., Luo, Y., et al.: 3D face reconstruction from a single image assisted by 2D face images in the wild. IEEE Trans. Multimedia 23, 1160–1172 (2020)
Article Google Scholar
Modeste, P., Reitano, S.: Facial expression analysis by k-means clustering on fiducial points of face (2019)
Rizwan, S.A., Alsufyani, N., Shorfuzzaman, M., Alarfaj, M., Jalal, A., Kim, K.: Automatic fiducial points detection for multi-facial expressions via invariant features and multi-layer kernel sliding perceptron. J. Electr. Eng. Technol. 18(1), 651–661 (2023)
Article Google Scholar
Chen, L., Su, H., Ji, Q.: Deep structured prediction for facial landmark detection. In: Advances in Neural Information Processing Systems, Vol. 32 (2019)
Duan, M., Li, K., Yang, C., Li, K.: A hybrid deep learning CNN–ELM for age and gender classification. Neurocomputing 275, 448–461 (2018)
Article Google Scholar
Hassan, K.R., Ali, I.H.: Age and gender classification using multiple convolutional neural network. In: IOP Conference Series: Materials Science and Engineering, Vol. 928, No. 3, p. 032039. IOP Publishing (2020)
Khan, K., Attique, M., Syed, I., Sarwar, G., Irfan, M.A., Khan, R.U.: A unified framework for head pose, age and gender classification through end-to-end face segmentation. Entropy 21(7), 647 (2019)
Article MathSciNet Google Scholar
Nada, A.A., Alajrami, E., Al-Saqqa, A.A., Abu-Naser, S.S.: Age and gender prediction and validation through single user images using CNN. Int. J. Acad. Eng. Res. (IJAER) 4, 21–24 (2020)
Google Scholar
Haseena, S., Saroja, S., Madavan, R., Karthick, A., Pant, B., Kifetew, M.: Prediction of the age and gender based on human face images based on deep learning algorithm. Comput. Math. Methods Med.. Math. Methods Med. 2022, 1–16 (2022)
Google Scholar
Ismail, M.K., Al-Ameen, Z.: Adapted single scale Retinex algorithm for nighttime image enhancement. AL-Rafidain J. Comput. Sci. Math. 16(1), 59–69 (2022)
Google Scholar
Dehghani, M., Bektemyssova, G., Montazeri, Z., Shaikemelev, G., Malik, O.P., Dhiman, G.: Lyrebird optimization algorithm: a new bio-inspired metaheuristic algorithm for solving optimization problems. Biomimetics 8(6), 507 (2023)
Article Google Scholar
Munawar, H.S., Aggarwal, R., Qadir, Z., Khan, S.I., Kouzani, A.Z., Mahmud, M.P.: A gabor filter-based protocol for automated image-based building detection. Buildings 11(7), 302 (2021)
Article Google Scholar
Manikpuri, U., Yadav, Y.: Image enhancement through logarithmic transformation. Int. J. (2014)
Bragatto, T., Cresta, M., Gatta, F.M., Geri, A., Maccioni, M., Paulucci, M.: A 3-D nonlinear thermal circuit model of underground MV power cables and their joints. Electr. Power Syst. Res. 173, 112–121 (2019)
Article Google Scholar
Khushi, H.M.T., Masood, T., Jaffar, A., Rashid, M., Akram, S.: Improved multiclass brain tumor detection via customized pretrained EfficientNetB7 model. IEEE Access (2023)
Wu, W., An, S.Y., Guan, P., Huang, D.S., Zhou, B.S.: Time series analysis of human brucellosis in mainland China by using Elman and Jordan recurrent neural networks. BMC Infect. Dis. 19(1), 1–11 (2019)
Article Google Scholar
UTK Face dataset (2017). UTK: https://susanqq.github.io/UTKFace/. Accessed on 26–12–2023.
Dawson, H.L., Dubrule, O., John, C.M.: Impact of dataset size and convolutional neural network architecture on transfer learning for carbonate rock classification. Comput. Geosci.. Geosci. 171, 105284 (2023)
Article Google Scholar

Navigation

Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network
基于 ASMNet 的 facial fiducial 检测和 Jordan 神经网络进行性别和年龄分类

Abstract 摘要

Similar content being viewed by others
其他人正在查看的内容

Face Detection and Facial Feature Extraction with Machine Learning
人脸检测与面部特征提取基于机器学习

A Comparative Analysis of Recent Face Detection Methods Implemented for Age and Gender Detection
《近期用于年龄和性别检测的人脸检测方法的比较分析》

Neural networks for facial age estimation: a survey on recent advances
神经网络在人脸年龄估计中的应用：近期进展综述

1 Introduction 1 引言

2 Literature review

3 Proposed methodology

3.1 Pre-processing

3.1.1 Cropping

3.1.2 Center surround device normalization

3.1.3 Optimized gabor filter

3.1.4 Logarithmic transformation

3.1.5 ASM network

3.2 Feature extraction

3.2.1 EfficientNetB7

3.3 Classification

4 Result and discussion

5 Conclusion

Availability of data and material

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical standards

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Search

Navigation

Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network基于 ASMNet 的 facial fiducial 检测和 Jordan 神经网络进行性别和年龄分类

Abstract 摘要

Similar content being viewed by others其他人正在查看的内容

Face Detection and Facial Feature Extraction with Machine Learning 人脸检测与面部特征提取基于机器学习

A Comparative Analysis of Recent Face Detection Methods Implemented for Age and Gender Detection 《近期用于年龄和性别检测的人脸检测方法的比较分析》

Neural networks for facial age estimation: a survey on recent advances 神经网络在人脸年龄估计中的应用：近期进展综述

Explore related subjects探索相关主题

1 Introduction 1 引言

2 Literature review

3 Proposed methodology

3.1 Pre-processing

3.1.1 Cropping

3.1.2 Center surround device normalization

3.1.3 Optimized gabor filter

3.1.4 Logarithmic transformation

3.1.5 ASM network

3.2 Feature extraction

3.2.1 EfficientNetB7

3.3 Classification

4 Result and discussion

5 Conclusion

Availability of data and material

Code availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical standards

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Gender and age classification using ASMNet based facial fiducial detection and Jordan neural network
基于 ASMNet 的 facial fiducial 检测和 Jordan 神经网络进行性别和年龄分类

Similar content being viewed by others
其他人正在查看的内容

Face Detection and Facial Feature Extraction with Machine Learning
人脸检测与面部特征提取基于机器学习

A Comparative Analysis of Recent Face Detection Methods Implemented for Age and Gender Detection
《近期用于年龄和性别检测的人脸检测方法的比较分析》

Neural networks for facial age estimation: a survey on recent advances
神经网络在人脸年龄估计中的应用：近期进展综述

Explore related subjects
探索相关主题