A mobile sensing method to counteract social media website impersonation

Abstract

Phishing is a serious threat to online users, especially since attackers have tremendously improved their techniques in impersonating important websites. With websites looking visually the same, users are fooled more easily. Visual similarity algorithms may help to detect and counteract some phished websites. Through similarity algorithms, the phishers play with the colors and visual properties of the website in a way that cannot be noticed by the users. However, the phishers make the unnoticed changes to fool the similarity algorithms as well. In this article, we propose an efficient phishing website detection algorithm using three-step checking. The performance results are compared to the state-of-the-art approaches that show new kinds of phishing warnings with better outcomes and less false positives. Our approach provides similar accuracy to the blacklisting methods with the advantage that it can easily classify the phishing websites with less overhead and without being victimized.

Keywords

Phishing detection website impersonation pixel-by-pixel blacklist whitelist extensible markup language phishing mobile sensing visual similarity

Introduction

Phishing attacks have affected many users through impersonating important official websites and, as a result, stealing personal information or financial data. To the help, visual similarity can be used to counteract phishing. Phishing attack is an undesirable common story. For instance, when someone follows a link to a website purporting to be his favorite social media website, he might get at a web page that looks like his favorite social media website. He finds the same logo he used to see at the social media. This phished website he visits was just published in the Internet within hours or days. As a result, it is most probably unknown to blacklists. Trusting the phished website, the visitor provides his credentials to the phisher. Perhaps, if the visitor knows that he should check for the domain name and indicators of a valid secure sockets layer (SSL) connection to ensure the integrity of the website, it would be fine, but most users are not aware of how to do this. Currently, some users use phishing detection tools and the browsers give various indications of a site’s authenticity. However, they only work if the phished website is recorded in the blacklist to assist the tools in detecting the phishing.

In this article, we introduce a new algorithm designed to overcome the phishing attacks. We explore some experiments on visual memory of the human to find how the person memorizes the website information when he comes back to it after a period. We try to simulate the user’s memory ability to help in strengthening the credential of the website in an implicit way to the user. We design our new model based on these experiments which is unprecedented and based on human behavior. This article is organized as follows. In section “Related work,” we show the related work. In section “Evaluating criteria,” we list the criteria to measure the performance of our new algorithm. Section “Experiments” discusses several experiments and facts that assist us in building our new model. Section “Result of experiments” shows the result of experiments and how to use them in our new model. In section “Our model,” we detailed our new model and its structure. In section “The superiority of our model,” we demonstrate the superiority of our model over the other comparable famous models and algorithms. Finally, section “Conclusion and future work” concludes the article and shows the future enhancement of this work.

Related work

Detecting phishing websites using visual similarity comparison has been proposed in several papers. Wenyin et al.¹ presented a concept that uses three types of similarity to detect phishing websites: “block level,” “layout,” and “total style similarity.” Medvet et al.² proposed a system that calculates a website signature using three features: a web page seen text sections, embedded images, and the overall visual appearance. Signatures can then be compared with the other signatures. They test their algorithm against a set of 140 phishing websites and 27 real websites performing very well. Chen et al.³ used the rendered web page as input to a normalized compression distance compressor. They test that on a set of 320 phishing websites that target 16 different real banking sites, their work shows that phishing websites rated significantly closer to their originals.

Machine-learning techniques can also be used to detect phishing. By converting the content of website⁴ or uniform resource locator (URL) and domain properties into a set of features or feature vectors, machine learning can look for websites that are similar, but having anomalous properties, such as “right” content in the “wrong” place.

Computer vision techniques⁵ can also be used to visually match the images on visited webpages with the originals. While these techniques can detect new phish, their approximate matching risks many false positives, and their high computational requirements make them difficult to run on clients. We instead employ precise content matching using some other techniques, such as cryptographic hashing to avoid false positives and to provide lightweight detection that can run in a client end like browser without centralized support.

All the related works^6–14 show the effect of phishing and the need to counterattack this security threat. In fact, detecting phishing websites through visual similarity works well in general. With our work, we further elaborate the idea by finding more efficient and faster detection algorithm through developing a three-step checking algorithm.

Evaluating criteria

In this section, we show how to evaluate the efficiency of phishing detection algorithms. Then, we apply this evaluation in our new algorithm in comparison to other algorithms. To show the big picture, let us suppose there is a phishing detection algorithm X. After X checks a website, it will give us a decision about the website. This decision can be any of four possible results for any tested website as described in Table 1.

Table 1.

Four status for the tested algorithm.

ID	A	B	Description	Status	Need
1	T	T	It is phishing and the algorithm says it is phishing	T+	Maxims
2	F	T	It is not phishing and the algorithm says it is phishing	F+	Minimis
3	T	F	It is phishing and the algorithm says it is not phishing	T−	Minimis
4	F	F	It is not phishing and the algorithm says it is Not phishing	F−	Maxims

A: the website is phishing; B: the algorithm says that the website is phishing.

The first and fourth possibilities should be maximized by X as much as possible and should minimize the second and third possibilities as much as possible. Let us call the first and fourth possibilities as “desirable opportunity,” while the second and third possibilities as “undesirable possibility.” We can check the rate of each set to measure the efficiency of the phishing algorithm.

A common mistake we have to care while evaluating the algorithms is, we should need to check not only the possible scenarios but also the website being phished. As a result, we have to run the algorithm on a mixed group of phished and non-phished websites. By doing this, we can test the four possibilities. In reality, we do not know whether the website is phished or not; furthermore, if it is phished, we do not know which website the phisher tries to impersonate.

Continuously, the owner of the website is updating it. Wherefore, we cannot rely on taking a snapshot of the real website and save it in our database to recognize the trusted website. Some solutions may take a snapshot of the actual website in the same moment of testing. It seems fine, but this solution is impossible and impractical because we are not aware of the actual website that the phisher tries to impersonate at the real time of testing. The answer to this question is also the answer to the question of whether the website is phished or not as depicted in Figure 1.

Figure 1.

The relation between two issues: A. Is it phishing or not and B. If it is phishing what is the original website.

Figure 2 describes the increasing amount of changes in the real website and how this affects the user behavior. These changes are done by the phisher to deceive the algorithm. These changes mostly unnoticed by the user and at the same time should deceive the detection algorithms to achieve the phisher goal. In the most left part of the arrow, there is not any change in the website, but just the URL. In that point, the user may ultimately be fooled and cheated, but any simple algorithm, such as pixel-by-pixel algorithms, can detect the phishing. As shown by perceptive hash (pHash) and radial hash (RADISH) algorithms, the middle part of the arrow is the most challenging because there are some differences between the phishing site and the real website where the simple detection algorithm cannot detect the phishing. Moreover, the user cannot notice that difference. The third part is not that critical because there are a lot of differences and as a result the user cannot be deceived.

Figure 2.

As real website changes, the users being cheated increase, hence detection algorithms cannot detect it.

Experiments

Before designing an algorithm to detect the phishing, we have to enclose all the possibility changes the phisher is trying to do to put his phishing website in the most challenging region. The phisher tries to make changes in the sit, wherefore the user cannot notice it as well as cheat the algorithm at the same time. This guides to do some experiments to find how the user memorizes the original website and which detail the user will ignore. If the phisher changes this detail, he will reach his goal.

The Benton Visual Retention Test (BVRT) is an individually administered test for people aged between 8 years and adulthood that measures visual perception and visual memory. It can also use to help identify possible learning disabilities among other afflictions that might affect an individual’s memory. The person examined had been shown 10 designs, one at a time, and had been asked to reproduce each one as accurately as possible on plain sheet from memory. The test is untimed and the results scored professionally by the properties of form, shape, pattern, and arrangement on the sheet. The number error score is calculated based on the number and type of errors made for each design. The major categories of these errors are rotations, misplacements, and the size errors as shown in Figure 3.¹⁵ From this test, we know all these changes can occur without the user attention. Wherefore, the hackers will try to do these changes to cheat the algorithm. Therefore, the first prosperity of our new algorithm is rotations, misplacements, and size errors carefree.

Figure 3.

The Benton Visual Retention Test card as present in Rowley and Baer.¹⁵

The number error score is calculated based on the number and type of errors made for each design. The major categories of these errors are rotations, misplacements, and size errors.¹⁵ From this test, we know all these changes can make without user attention. Wherefore, the hackers will try to do these changes to cheat the algorithm. Wherefore, the algorithm should be carefree about these changes. So, the first prosperity of our new algorithm is rotations, misplacements, and size errors carefree.

The second test is about color. The target of this test is to find how often the user can remember the colors of a website. It is composed of four forms that measure the examinee’s visual and memory abilities to remember the color. The test is a multiple-choice question. After the examinee sees a color, we give him a card containing four colors, and he has to choose the correct one. The sample is 40 people divided into four groups. Each group contains 10 people as depicted in Figure 4. Moreover, each group has certain time and conditions as provided in Table 2, and the score of this test is shown in Table 3. In these tables, we can see that the users can differentiate and remember the color if the fake color has big changes whether it occurs recently or long time ago. However, they can only remember 65% of the color if the hackers make small changes.

Figure 4.

Color test cards, each of them offered to one group.

Table 2.

Four groups of color test.

Group	Description	Total no. of views
A	There is a big difference in the choices of the colors	1 time in 1 day
B	There is a small difference in the choices of the colors	1 time in 1 day
C	There is a big difference in the choices of the colors	7 times in 1 week
D	There is a small difference in the choices of the colors	7 times in 1 week

Table 3.

The score of the four groups of color tests.

Group	True	False
A	10	0
B	6	4
C	10	0
D	7	3

Therefore, we can derive from that the algorithm should ignore the small changes in the color but not the big one.

Result of experiments

From the previous tests and positive phishing detection symptoms, we can design our optimal and new appropriate algorithm. Our design should include the following properties:

It must be carefree about rotations, misplacements, and size errors changes as depicted in Figure 5.

It should ignore the small changes in the colors but not the big one.

Figure 5.

Examples of different types of impersonation with rotations, misplacements, and size errors.

It has been estimated that humans can distinguish roughly 100 thousand different colors.¹⁶ Furthermore, the RGB model color in HTML can represent 17 million colors.¹⁷ As a result, we can make our new model to ignore 17 colors because there are 17 sets where the user cannot distinguish but the algorithm can. Moreover, only about 25% of the population is tetrachromat.¹⁸ Diana Derval performed an experiment to calculate that.

Figure 6 shows that there are 39 different colors. However, few people can distinguish between them and it gives us an indicator of how much a human can differentiate between colors.

Figure 6.

A test to measure how people can differentiate between colors.

Our model

Our new model has three main steps as shown in Figure 7. The direction of the first step is from the unknown website to the database. Furthermore, the direction of the second step is from the database to the unknown website. After that the algorithm can decide whether the unknown website is a phishing website or not, as depicted in Figure 7 where the dynamic chart starts from the “start” square.

Figure 7.

Three main steps of the algorithm.

As a prosperity step, we have a database that contains some information about the trusted websites as shown in Table 4 and Figure 8.

Table 4.

The prosperity step (a sample of the trusted websites database).

ID	Name	URL	Max color
1	Facebook	www.facebook.com	(23, 144, 22) (44, 122, 88)
2	Paypal	www.paypal.com	(22, 144, 22) (44, 122, 88)
3	Visa	www.visa.com	(22, 144, 22) (44, 122, 88)

Figure 8.

Prosperity step in our model.

After the prosperity step, we should have the names, URLs, and logos of each website stored in the database or the trusted websites array as the code.

Therefore, the first step is to extract the three max color occurrence as shown in Figure 9. And check them in the database; if it had found some similar website, it will return them as a set of a websites. Hence, there is a probability that the website is phished as shown in Figure 10.

Figure 9.

Extraction of the max color from the unknown website using the same extracting algorithm.

Figure 10.

Checking the max color in the database.

Therefore, we need more process to make sure whether those websites stored in the may-phish-me array are clean of phishing or not. Because of that we need the second step, which takes a logo of each site in the returned set from the database and check them in the unknown website. If a logo found and the URL is not similar, then the unknown website is a phishing website. Moreover, if any condition is not true, then the unknown website is not a phished website.

As Figure 11 describes, we use the most three max color occurrences in the website page to extract the max color. However, there are several choices like pHash or discrete cosine transform (DCT) hash algorithms that can be used. Actually, the DCT hash algorithm gives better accuracy because it is not affected with the small changes in the colors as depicted in Figure 12.

Figure 11.

Checking the logo and URL in the unknown website.

Figure 12.

DCT hash resistance five points.

Actually, we use five-point colors. However, we can use up to 17 points or more because not every human can detect the little difference in the color.

Figure 13 summarizes the three steps intended in our algorithm. We start our algorithm through registering a whitelist that contains all websites which we want to protect from phishing such as bank websites, social media websites, and governmental websites. At the end of our algorithm, we reach the judgment of whether the unknown website is phished or not.

Figure 13.

A summary of our model (the three-step algorithm).

The superiority of our model

We tested our algorithm in 600 phished websites and we had 94.99% accuracy. First, we collected a random sample from PhishTank project.¹⁹ The samples are 600 real phished websites. These samples are targeting 70 real websites. Through MATLAB using a machine vision toolbox to fetch the logos of the real website and the unknown website, we performed the testing.

As described in Table 5 and Figure 14, the Blacklist methods have the best accuracy, speed, and no phisher can cheat it by any of the phishers’ techniques, but it has a discomfiture issue. Blacklist algorithms depend on having victims. Those victims will report the blacklist system with the phished websites. The phished websites will be listed and stored to prevent having any more victims. However, the goal of the phisher is being achieved since he cracked one or two bank accounts. In the contrary, our algorithm has all blacklist algorithms’ advantages and no need to have victims. We designed our model to be fortified against most challenging area attacks. Moreover, our new model cannot be cheated by changing the extensible markup language (XML) code. For instance, if some phisher tries to change the XML code with another one to give the same interpreted page to cheat XML phishing detection algorithms, this will not deceive our algorithm because it takes a snapshot of the final interpreted result no matter what the XML code looks like. Our algorithm unlike whitelist algorithms compares each website listed in the whitelist with the unknown website. This high number of comparisons makes the whitelist model very slow and unusable at the end user point of view. However, our model also makes such comparisons, but utilizes the max-color hashing, and this makes the algorithm effective and has a complexity of log(n) efficacy.

Table 5.

Comparison between several algorithms.

Property/approach	Accuracy	Speed	Should have victims	Cheated by changing the xml	Cheated by changing colors	Cheated by shapes rotation	Cheated by size error changes	Most challenging area
Blacklist	Too high	Too high	Yes	No	No	No	No	No
Content xml, text	Mid	High	No	Yes	No	No	No	Yes
Pixel by pixel	Poor	Mid	No	No	No	No	No	Yes
Visual similarity	High	Mid	No	No	Yes	Yes	Yes	Yes
Whitelist	Too high	Too poor	No	No	No	No	No	No
Our new model	Too high	High	No	No	No	No	No	No

Figure 14.

Performance of our model, whitelists, and blacklists.

We propose to use our model as a plug-in with several Internet browsers to give the user a warning state if he enters a phishing website. It alarms the user when the website X is a phished website (X could be any trusted website). Certainly, before applying our new model, all the targeted websites (targeted websites are the websites that the phishers try to impersonate) should be retested in our database, and our database should be updated when there is a change in style or logos.

Conclusion and future work

This article presents the first step in a new approach of websites’ phishing detection using hyper vision techniques. We show results that defeat the vast majority of the current attack approaches. Nonetheless, we believe there are still more rooms to improve these outcomes. Max-color extraction improvement, Searching logos improvement, and Multi-logo website problems are our next future work to explore.

Footnotes

Academic Editor: Zubair M Fadlullah

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Deanship of Scientific Research at King Saud University (no. RGP-1437-35).

References

Wenyin

Huang

Xiaoyu

. Detection of phishing web pages based on visual similarity. In: Proceedings of the fourteenth international world wide web conference, Chiba, Japan, 10–14 May 2005.

Medvet

Kirda

Kruegel

Visual-similarity-based phishing detection. In: SecureComm conference, Istanbul, Turkey, 22–25 September 2008.

Chen

Dick

Miller

Detecting visually similar web pages. ACM T Internet Techn 2010; 10: 25–30.

Gowtham

Krishnamurthi

Phishtackle–a web services architecture for anti-phishing. Cluster Comput 2014; 17(3): 1051–1068.

Afroz

Greenstadt

. Phishzoo: detecting phishing websites by looking at them. In: Proceedings of the fifth IEEE international conference in semantic computing (ICSC), Palo Alto, CA, 18–21 September 2011.

Park

Taylor

JM.

Using syntactic features for phishing detection. 2015, https://arxiv.org/ftp/arxiv/papers/1506/1506.00037.pdf

Nazif

Whittaker

Ryner

. Large-scale automatic classification of phishing pages. In: Proceedings of the 17th annual network and distributed system security symposium (NDSS’10), San Diego, CA, 28 February–3 March 2010.

APW Group. Global phishing survey: domain name use and trends. IEEE Commun Surv Tutor 2013; 15(4): 112–130.

Nguyen

LAT

Nguyen

BL.

An efficient approach based on neuro-fuzzy for phishing detection. J Autom Control Eng 2016; 4(2): 159–165.

10.

Kim

Huh

Detecting DNS-poisoning-based phishing attacks from their network performance characteristics. Electron Lett 2011; 47(11): 656–658.

11.

Hong

The state of phishing attacks. Commun ACM 2012; 55(1): 74–81.

12.

Rader

Rahman

SM.

Exploring historical and emerging phishing techniques and mitigating the associated security risks. Int J Netw Secur Appl 2013; 5(4): 23–41.

13.

Yang

Ding

A minimum enclosing ball-based support vector machine approach for detection of phishing websites. Optik: Int J Light Electron Opt 2016; 127(1): 345–351.

14.

Caputo

Pfleeger

Freeman

. Going spear phishing: exploring embedded training and awareness. IEEE Signal Proc Mag 2014; 12(1): 28–38.

15.

Rowley

Baer

PE.

Visual retention test Performance in emotionally disturbed and brain-damaged children. Am J Orthopsychiat 2010; 31(3): 579–583.

16.

Calkins

DJ.

Mapping color perception to a physiological substrate (The visual neurosciences, vol. 1). Cambridge, MA: The MIT Press, 2014.

17.

Osman

Hitam

Ismail

MN.

Enhanced skin colour classifier using RGB ratio model. Int J Soft Comput (IJSC) 2012; 3(4): 54–60.

18.

Derval

The right sensory mix: targeting consumer product development scientifically. Berlin, Heidelberg: Springer, 2013.

19.

Johnson

ME.

Managing information risk and the economics of security. Berlin, Heidelberg: Springer, 2009.