Abstract
Video Processing has found enormous applications in recent times from security systems to interplanetary missions. In real-life situations, most of the videos are fuzzy/vague/uncertain. Intuitionistic fuzzy set (IFS) is one of the effective tools for handling uncertainty. Among many extensions of IFSs, temporal intuitionistic fuzzy sets (TIFSs) are very interesting as they are time-dependent. Hence, TIFSs are suitable to define a video, which is dynamic and hence depends on time-moment. In this way, this work introduces a novel VIdeo PROCessing (VIPROC) algorithm, using temporal intuitionistic fuzzy sets to enhance videos, which is first of its kind in existence. The comparison is made with fuzzy contrast intensification operation. VIPROC algorithm is designed using contrast intensification operation for video enhancement. The results are encouraging in comparison with the original test videos. The results are discussed taking into account the several frames of the test video. Further, the proposed algorithm can be applied/extended to engineering applications like motion tracking, traffic detection systems, real time videos captured through mobile (hand-held) devices, and so on. As no such algorithms are existing which use TIFSs to process a video, the authors got motivated to design and develop VIPROC algorithm.
Keywords
Introduction
The set theory devised by George Cantor is the foundation of modern mathematics and definitions of the most fundamental mathematical terms are based on sets. A set is a well-defined collection of objects. Here, well-defined means that there is a definite method to determine whether an object belongs to the set. Generally, a set is described by a two-valued function called characteristic function.
The real-world situations are often associated with built-in uncertainties. Hence, fuzzy sets, introduced by Lotfi A Zadeh –blurred, vague, indistinct quantities –this means that sets and their surroundings, their complement within the universe of discourse, are connected. In the usual diagrammatic notation of sets as circles, fuzzy sets appear to be sets with indistinct edges. There are objects at the boundaries of the circle that belongs to the fuzzy set and its complement, and hence the complement of a fuzzy set is again a fuzzy set [44]. A fuzzy set is characterized by its membership function. The membership grade in a fuzzy set is not a matter of affirmation or denial, but rather a matter of degree. In fuzzy set theory, the membership of an element to a fuzzy set is a single value between zero and one.
Due to the insufficiency in the availability of information, the evaluation of membership values is sometimes not possible up to our satisfaction. Therefore a generalization of fuzzy sets was introduced by Krassimir T Atanassov in 1983 as intuitionistic fuzzy sets (IFSs), which include both membership and non-membership of the element in the set, where the non-membership value equals one minus the membership value. However, in reality, it may not exist, because there remains a part indeterministic on which hesitation survives. This value is called the intuitionistic fuzzy index (hesitancy index). In such situations, the theory of IFS seems to be applicable to address this issue of uncertainty. In the case, when the degree of rejection is defined simultaneously with the degree of acceptance and when both these degrees are not complementary to each other, then IFS can be used as a more general tool for describing uncertainty. The authors got motivated from the following few open problems on the intuitionistic fuzzy set (IFS) and its extensions available in [24].
i. To investigate the properties of IFS in the field of image processing, pattern recognition, networks for finding the shortest route.
ii. Develop efficient algorithms for the construction of the degree of membership and non-membership of a given IFSs.
iii. To develop algorithms for defuzzification and comparison of IFSs.
Hence, an attempt has been made to work on these three problems to provide an IF logic tool for video processing.
Video processing tends to have uncertainties that vary each second. This tempts to propose the ‘temporal’ concept for the IFS. Video processing is a subset of image processing where the video is dissolved as frames and processed and again assembled to the output as a video. The process of dissolving and assembling takes time and hence many video processing algorithms tend to be slow and deviate in terms of accuracy. The video consists of a set of images/frames. The image/frame is defined with the help of pixels. TIFS are introduced in [27].
The role of contrast intensification on sets, fuzzy sets, IFSs and TIFSs in image and video processing is discussed, in detail, in the forthcoming sections of this paper. In section 2, review of literature is given. In Section 3, the basic concepts of intuitionistic fuzzy sets and a few operations over them are defined. Also, image representation using sets, fuzzy sets and intuitionistic fuzzy sets are discussed. Section 4 deals with operations like
Literature review
Melo-pinto et al. described a thresholding technique using Atanassov’s intuitionistic fuzzy sets (A-IFSs). This approach uses Atanassov’s intuitionistic index values for representing the hesitance of the expert in determining whether the pixel belongs to the background or to the object. Segmentation experimental results and comparison with Otsu’s multilevel thresholding algorithm for the calculation of two and three thresholds are presented by them [34].
A new distance measure called intuitionistic fuzzy divergence, has been proposed and edge detection is carried out in [47].
In [16], the issue of moving object detection using a background subtraction technique is addressed. There are complex conditions in the real world video sequences that show multi-modal distributions of a scene. These multi-modal distributions cannot be modeled by just one feature. So, different image features should be extracted and fused to handle such conditions. The extracted features should be complement to each other. Hence, to efficiently model these multi-modal distributions, the authors propose fusing color and texture features using Choquet fuzzy integral. To capture spatial information, the spatial color features are extracted by Atanassov’s Intuitionistic Adaptive 3D Fuzzy Histon Rough-ness Index. Additionally, some texture features are extracted by a covariance matrix. Moreover, the authors proposed a novel model updating method than can cope with a number of challenges such as model initializing with moving objects, fast/slow moving object detection, and the moving objects that stop for a while. The proposed method is evaluated on a number of benchmark video sequences representing various real-world challenges. Results demonstrated that the proposed method can outperform the state-of-the-art methods in most of the experimented scenarios and challenges.
Pagola et al. present a valuable review of the recent works using extensions of fuzzy sets in image processing. The basics of the extensions of fuzzy sets are recalled. That is, type-2 fuzzy sets, interval-valued fuzzy sets and Atanassov’s IFSs. Also, a review on the proposed methods for noise removal, image enhancement, edge detection and segmentation is made [37].
In [21], a new construction method of IVFSs from fuzzy sets is presented. Also, the authors use these IVFSs for image processing. Concretely, in this contribution they introduced a new image magnification algorithm using IVFSs. This algorithm is based on block expansion and it is characterized by its simplicity.
Tizhoosh’s algorithm makes it possible to obtain additional information from commonly used fuzzy algorithms. In [3], the authors proved that Tizhoosh’s algorithm performs better than area algorithm. Therefore, the use of Tizhoosh’s algorithm in image processing and the use of interval-valued fuzzy sets is justified.
Research on video processing is a continuously explored area that is still finding immense applications in diverse domains. But its goal remains unchanged: Extraction of the desired target from the image frame is considered. There has been a huge increase in Internet Protocol (IP) based digital cameras from analog cameras. It has been estimated that 25% to 50% of the cameras have video analysis processors built in them. Some video analysis techniques such as extraction of features and categorization of objects are used for the desired target. Another important technique is pre-processing which results in more accuracy while analyzing the video in further stages of the processing [42].
Two-dimensional textures were studied to some extent in the past. Yet, there is relatively very less research and results on the three-dimensional dynamic textures that express certain time-varying properties in the video processing. In many frames, the regions have important structural similarities and exhibit high temporal correlations between image frames extracted from the actual video. The dynamic textures draw some disadvantages in motion detection and analysis. With the use of segmentation and exclusion of the dynamic textures in the image frames, the robust nature of the motion detection and action identification is improved [7]. The study introduces an optical flow-based to represent temporal video information. This type of approach is suited for the segmentation of the temporal video. This problem can be converted as a cut detection problem since it is the phase I of the content-based video information retrieval. In the first step, optical flow vectors are calculated and this representation is employed for the cut detection. The important aspect of the proposed idea is the way it solves the temporal video representation problem in a video information retrieval domain. Adding to the established results, this proposed way of representation acts as a solution for the dimensionality problem for the video data [44].
The main contents in the surveillance include the detection of human movements and the estimation of the population density and their tracking [19]. The study aims to extract and analyze different types of extraction of features from the video frames [10]. But, the problem of objective perspective deformation has more influence on the detection accuracy in all kinds of video monitoring analysis. Solving the problem of objective perspective deformation on the video feature extraction and analysis is an important aspect that has to be studied in several video analysis algorithms. The work proposes a perspective normalization weights that are compared with the linear perspective weights of the existing literature that is used in the moving object detection processing algorithms in the later stages. The results reveal that this method can solve the impact of the perspective distortion concerning the accuracy of the video analysis [2]. The recent developments in video processing that uses deep learning primitives have given way to several breakthroughs in the basic and important issues of video analysis such as classification of frames and detection of objects. These developments give way to a new list of applications. More particularly, Nick et al.introduce a set of approximate filters the speeds up issues involving the target objects in the particular video frame and their spatial relations. These filters can access fast if the issue predictions are true to proceed with more analysis for the frame or else leave for the next frame with minimizes the cost of the operations for the present frame. The work has proposed a set of filters for the estimation of the number of objects in a frame, the number of objects of a specific class in a frame as well as to assess an estimate of the spatial position of an object in a frame enabling us to reason out spatial constraints. These filters are ranked based on accuracy and their high accuracy in counting and their use for location estimation are proved experimentally with the help of real video data [36].
Based on fuzzy set modelling and fuzzy intensification operator, an implementation of contrast enhancement in the compressed domain is presented [13]. A Gaussian membership function to model image information in spatial domain has been proposed in [18]. Three enhancement techniques namely fuzzy rule based contrast enhancement, contrast enhancement using intensification (INT) operator, and contrast enhancement using fuzzy expected value (FEV) are presented for the low contrast grayscale images [45]. The contrast of Flat Electroencephalography (fEEG) image is enhanced by using a fuzzy approach. The fEEG image itself is a fuzzy object which is in grayscale. It is originated from a technique known as fEEG which mapped high dimensional signal into low dimensional space [46]. Detailed image analysis has been done using different qualitative attributes of the image in the simulation. The results show better enhancement in image contrast. It can be obtained by taking a different membership function for transformation of membership values in the fuzzy domain [32].
The contrast and sharpness of the images captured under water will be significantly deteriorated and diminished due to the low perceptibility of the image caused by the physical properties of water as a medium. An improved version of CLAHE mechanism is used to enhance color images by using dynamic histogram equalization technique [29]. In real-life situations, this type of document occurs as handwritten/printed text and historical documents. Contrast intensification operator on IFSs can be used to improve the quality of such degraded images [20]. Acquiring human gait data can be erroneous due to various factors and multimodal fusion of such erroneous gait data can be challenging. This work proposed a new decision fusion-based approach to solve the above problem. Gait data is recorded simultaneously using motion sensors and a visible light camera. The signals of the motion sensors are modelled using a Long Short Term Memory (LSTM) neural network and corresponding video recordings are processed using a 3D Convolutional Neural Network (CNN). GWO has been used to optimize the parameters during fusion. It has been chosen based on the underlying hunting strategy that leads to a better approximation of the solution and interestingly, in our case it converges quicker than other optimization techniques such as Genetic Algorithm (GA) or Particle Swarm Optimization (PSO) [29].
Object tracking from video sequences, although well researched, still faces difficulties under certain challenging situations. The work proposed a new particle filter-based tracking, where the fuzzy observation model exploits spatial correlation in a rough set-theoretic framework. It further improved certain critical steps of the particle filter such as mean state estimation and resampling [8]. A novel concept, namely, intuitionsistic entropy is introduced in [6] which consists of two new measures: neighborhood rough entropy and neighborhood probabilistic entropy to deal with the ambiguities that arise due to occurrence of overlapping/occlusion in a video sequence.
In [31], a novel system design framework for dynamic, data-driven processing of multispectral video streams using lightweight dataflow (LD) techniques is introduced. The authors of [40] made a survey on maritime object detection and tracking approaches are presented. In [9], an ensemble of three video processing techniques are benchmarked against each other and tested as viable options for the non-contact dynamic characterisation of slender beam-like structures. A novel 3D fuzzy method for the denoising of color videos corrupted by additive noise is presented [48]. Intuitionistic logic based on block matching techniques has been used to strongly and proficiently detect abandoned objects based on background subtraction is presented [41].
In a nutshell, the following are the two main reasons to introduce the proposed algorithm. (i) After making a thorough study/survey on video processing techniques/algorithms, it is observed that temporal IFS theory is not applied to design and develop an algorithm for processing videos. (ii) The authors are working on image processing and made a notable contribution in image processing using IFS theory. Therefore, the authors are interested to apply temporal IFS theory in video processing.
In this paper, the proposed VIPROC Algorithm is meant for contrast enhancement of a video using contrast intensification operator on TIFSs and this approach is first of its method.
Image representation using sets, FSs and IFSs
In this section, an overview is given on geometric interpretation of Sets, FSs and IFSs. In addition, the mathematical representation of images using Sets, FSs and IFSs are also discussed.
A set
The mapping may be represented as a set of ordered pairs {(

Sets.
Let

Fuzzy Sets.
Let

First geometrical interpretation of an IFS
For a clear understanding of intuitionistic fuzzy set, other similar diagrammatic representations are given in Figs. 4 and 5.

An analogue of the first geometrical interpretation of an IFS

Second geometrical interpretation of an IFS
An ordinary fuzzy set can also be written as
A monochrome image, or simply, image (having various shades of gray) is represented mathematically by a spatial brightness function
whose row and column indices identify a point (
Example:
In recent years, many researchers have applied the theories of fuzzy sets and intuitionistic fuzzy sets and their extensions to develop new techniques for contrast improvement.
As discussed earlier, a crisp set
Figure 6 depicts a binary image (white is assigned ‘0’ and black is assigned ‘1’).

Binary Image.
A fuzzy set,
Figure 7 shows a gray image scale with ‘0’ for white, ‘1’ for black and between ‘0’ and ‘1’ for gray.

Grayscale Image
An intuitionistic fuzzy set,
Hence, it is noteworthy that a RGB image can be represented by an IFS, by taking three functions to define three colors. Therefore, a RGB image
This representation is given in Figs. 8 and 9.

RGB Color map [19].

Equilateral triangle geometrical interpretation of an IFS
In this section, the basic concepts of intuitionistic fuzzy sets and operations like
Let
Define the following operations on A and B:
i.
ii.
iii.
Further, the operation contrast intensification has the effect of altering the intuitionistic fuzziness in an intuitionistic fuzzy set
Contrast intensification, as its name implies, reduces the fuzziness of
The
The
Temporal IFSs and their operations
In the previous section, IFSs and their role in image processing are dealt with. This section is dedicated to TIFSs and their operations. TIFSs give a possibility to trace the changes of the object considered for all the time moments from a time scale and permit more detailed estimations of the real time processes flowing in time.
Let
(1)
(2) 0 ≤ μ
(3) μ
The geometrical representation of TIFS is given in Fig. 10.

Geometrical interpretation of TIFS
Hence, it is very important to note from Fig. 10 that each triangle represents a frame of a video. The operations on TIFSs are useful to enhance the quality of a video. They are defined as follows.
Let
Define the following operations on
(1)
(2)
(3)
The
where μ
The design of the traditional logic controller [34] usually requires a mathematical model of the process involved. The construction of such a model is difficult for many real-world problems due to partial or unreliable information. The imprecise description of the problem can be handled as an alternative approach by expert human operations. This modeling leads to the usage of fuzzy concepts that are close to human perception than the traditional logical system. The basic structure of an intuitionistic fuzzy inference system is shown in Fig. 11.

Structure of an intuitionistic fuzzy inference system.
The system includes the following components.
(1)
(2)
(3)
(1) If any of the defined fuzzification functions is not found suitable for the specific problem, the user can define a required fuzzification function, based on the requirement.
(2) Unlike defuzzification to a single value in the fuzzy controller, intuitionistic defuzzification gives a matrix of defuzzified values in [0,1], corresponding to the gray levels of the given image/video.
The following notations are used in the proposed VIPROC Algorithm:
ɛ degree of hesitancy
μ
μ
μ
Algorithm 1 VIPROC Algorithm
Step 1: Read the input video in MATLAB Environment and split the video into frames using MATLAB built-in functions,
Step 2: Using intuitionistic fuzzification functions, fuzzify the gray matrix of the frame(s)
While
If
else If
else If
Else
While
If
Else 1 - ɛ
Step 3: Apply the intuitionistic contrast intensification operator to the fuzzified gray matrix of the frame(s)
μ
Step 4: Defuzzify the resultant gray matrix of the frame(s) using intuitionistic defuzzification functions
While
else If
else
Step 5: Merge the output frames by using MATLAB built-in functions and view the output video
Implementation
The algorithm is developed and implemented using MATLAB 2018b.
To verify the algorithm, a test video of size “240 × 320 × 3” running for “10” seconds is taken for processing. There are totally 159 frames in the video. It is read frame by frame. For the implementation process,

Input
After reading the video,
Intuitionistic fuzzy membership and non-membership values are calculated for the matrix in (2) and displayed as follows in (3) and (4) respectively.
Contrast intensified membership (μ
After defuzzification, the new gray level (
The frame corresponding to the new gray level matrix obtained in Step 4 is depicted as output frame given in Fig. 13.

Output
The input frames 1, 25, 50, 75, 100, 125, 150, 159 are shown along with the corresponding histograms in Figs. 14 and 15.

Input

Input
The output frames 1, 25, 50, 75, 100, 125, 150, 159 are shown along with the corresponding histograms in Figs. 16 and 17.

Output

Output
In Figs. 15 and 17, though the histograms look similar,they are not so. The justification can be made with a reference to image similarities. For example, in Fig. 17, the pixel difference of frame 1 and frame 25 is 2798 pixels. This value is too less compared to the total number of pixels of 230,400. This contributes only 1.2% of the total which is very less and so, it cannot be visualised in the histogram. Similarly in Fig. 15, the pixel difference between frame 1 and frame 25 is 5179 pixels which contributes only 2.24% of the total number of pixels of 230,400. Hence, the difference is minimal.
In this section, the comparison is made with fuzzy video processing and the proposed algorithm. A real-time video of size “720 × 1280” running for “12” seconds is taken for processing. There are a total of 295 frames in the video. Input frames and fuzzy contrast intensified output frames and our proposed algorithm’s output frames of this video are displayed in Fig. 18.

Input frames, Fuzzy contrast intensified image and the proposed algorithm’s output.

Input frames, Fuzzy contrast intensified image and the proposed algorithm’s output histogram.
Another medical scan video [49, 50] of size “1080 × 1920” running for “3” seconds is taken for processing. There are a total of 90 frames in the video. It is read frame by frame. Input frames and fuzzy contrast intensified output frames and our proposed algorithm’s output frames of this video is displayed in Fig. 20. Also histograms of these frames are displayed in Fig. 21.

Input frames, Fuzzy contrast intensified image and the proposed algorithm’s output.

Input frames, Fuzzy contrast intensified image and the proposed algorithm’s output
VIPROC algorithm is very useful in real-time and medical video processing to enhance the video quality. The proposed algorithm gives better result than the fuzzy video processing technique which is demonstrated in this section.
Temporal intuitionistic fuzzy mathematical tools offer a new and flexible framework for designing and developing video enhancement algorithms. It is shown that the contrast intensification operator plays an important role in improving the quality of a video. The following are the major tasks done in this research work: (a) Representation of a video by a TIFS, (b) Video Processing (VIPROC) algorithm is designed and developed in MATLAB environment. The following are the ways by which the developed system is significant in video processing: (a) First attempt to use IFSs to represent RGB images and TIFSs to represent videos, (b) The intuitionistic fuzzy inference system is designed for the specific application, (c) A newly constructed defuzzification function is used. Further, it is proposed to analyze color videos without converting them into gray videos. For, it is necessary to extend this algorithm for red, green, blue separately and then be concatenated, which needs a vigorous parameter tuning. Also, the authors proposed to modify and improve the VIPROC algorithm, using the mathematical operations like dilation and erosion to improve the quality of a video, in their future work.
Footnotes
Acknowledgment
This work was funded by the Chinese National Natural Science Foundation under Grant 61876047.
