Open Access Open Access  Restricted Access Subscription or Fee Access

Categorization of Video using Viola Jones and Fisher’s Linear Discriminant Function

Shoba Rani, D. Syed  Ali


A large amount of single-shot, short videos are created by using personal camcorder in day-to-day life. Many videos are kept in a video pool and merged into a single video. Categorization is done based on transition clues like objects or human beings. For categorization process frame-by-frame search is made on videos in a video pool. Frames are extracted from Video using Viola Jones algorithm. In each frame, complete object is extracted.  Features are extracted from the remaining portion of object using Fisher’s Linear Discriminant function. The features are extracted from objects is considered as a pattern. If 20 frames belonging to a video are considered, then 20 patterns are created. This proposed system is mainly used for separating human beings and objects.


Categorization, Integral Image, HAAR Detector, Viola Jones Method, Fisher’s Linear Discriminant Function

Full Text:



“Robust Multi-Pose Face Detection in Images” by Rong Xiao, Ming-Jing Li, Hong-Jiang Zhang

“Efficient Eyes and Mouth Detection Algorithm using Combination of Viola Jones and Skin Color Pixel Detection” by Ijaz Khan, Hadi Abdullah and Mohd Shamian Bin Zainal(2013)

“Accelerating Viola-Jones Face Detection to FPGA-Level using GPUs” by Daniel Hefenbrock, Jason Oberg, Nhat Tan Nguyen Thanh, Ryan Kastner, Scott B. Baden

“Learning a Dynamic Classification Method to Detect Faces and Identify Facial Expression”

Ramana Isukapalli1, Russell Greiner2, and Ahmed Elgammal3

“Image Compression Using Modified Fast Haar Wavelet Transform” by Anuj Bhardwaj and Rashid Ali,

“Highly Scalable, Parallel and Distributed Adaboost Algorithm using Light Weight Threads and Web Services on A Network of Multi-Core Machines” by Munther Abualkibash, Ahmed Elsayed, Ausif Mahmood(2013)

Pantic M., and Rothkrantz L.J.M., 2000, Expert system for Automatic Analysis of facial Expressions, ELSEVIER, Image and Vision Computing, Vol.18, pp.881-905

Viola P., and Jones M., 2001, Robust real-time object detection, Technical Report 2001/01, Compaq Cambridge Research Lab

Essa I.A., and. Pentland A.P., 1997, Coding, analysis, interpretation and recognition of facial expressions, IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.19, No.7, pp.757- 763.

Turk M., and Pentland A., 1991, Eigenfaces for recognition. Journal of Cognitive Neuroscience, Vol.3, No.1, pp.71-86.

Steffens J., Elagin E., and Neven H., 1998, Person spotter fast and robust system for human detection, Tracking, and recognition, In Proceedings International Conference on Automatic Face and Gesture Recognition, pp.516-521.

Littlewort G., Fasel I., Stewart M., Bartlett and Movellan J. R., 2002, fully automatic coding of basic expressions from video, Technical Report, UCSD INC MP Lab.

Boccignone G., Caggiano V., Fiore G. D., and Marcelli A., 2005, Probabilistic detection and tracking of faces in video, In Proceedings of International Conference on Image Analysis and Processing, pp.687-694

Arnaud E., Fauvet B., Memin E., and Bouthemy P., 2005, A robust and automatic face tracker dedicated to broadcast videos, In Proceeding Of International Conference on Image Processing, pp.429-432

Li Y., Ai H., Huang C., and Lao S., 2006, Robust head tracking with particles based on multiple cues fusion, ECCV Workshop on HCI, pp.29-39

Lades M., Vorbruggen J.C., Buhmann J., Lange J., vonder Malsburg C., Wurtz R.P., and Konen W., 1993, Distortion invariant object recognition in the dynamic link architecture, IEEE Transactions on Computers, Vol.42, pp.300-311.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.