Yagya R Panddeya

Citizenship: Nepal

Permanent Address: Sukhlaphata-10, Kanchanpur, Nepal

Mailing Address: Kathmandu University, Kavre, Nepal

Date of birth: 25th July, 1988

Phone: (+977)9848542617, (+977)9701002605

yagya.pandeya@ku.edu.np

yagyapandeya@gmail.com

Carrier Objectives

An enthusiastic and adaptive person with a broad and acute interest in the discovery of deep learning and AI. I particularly enjoy collaborating with tech experts from different disciplines of computer science to develop new skills and solve new challenges.

Bio

I am an assistant professor and acting head of department (HoD), Department of Artificial Intelligence at Kathmandu University, and also affiliated to the Guru technology research group in Nepal. I have a good experience of deep learning and machine learning technologies for image, audio, music, video and text processing.

My primary research interests focus on music processing, multimodal information retrieval, and the application of AI in agriculture, tourism, culture preservation, and 3D modeling using deep neural networks.

Educations

Jeonbuk National University

Major in Machine Learning and Deep Neural Networks

CGPA: 4.43/4.50

Sep. 2017 - Feb. 2021

Pokhara University

Nepal College of Information Technology, Balkumari, Lalitpur(PU Affiliated College)

Rank: Dean's List (CGPA: 3.97/4)

Aug. 2010 to May 2013

Pokhara University(PU)

National Academy of Science and Technology, Dhangadhi, Kailali(PU Affiliated College)

Rank: CGPA: 3.54/4

Aug. 2005 to Oct. 2010

National Examination Board(NEB)

Radiant Secondary School, Mahendranagar, Kanchanpur(Under NEB)(Class XII)

OVERALL PERCENTAGE: 55%

2005

National Examination Board(NEB)

Shree Radha-Krishna Secondary School, Tiltali, Doti(Under Government of Nepal)(Class X)

OVERALL PERCENTAGE: 74%

2003

WORK EXPERIENCES

Kathmandu University (AI Department, Head)

AI based teaching and research

Address: Kavrepalanchok, Nepal

Status: Assistant Professor

01 March 2022 - Ongoing

Guru Technology

Deep learning based research and advisor.

Address: Kathmandu, Nepal

Status: Director of Research & Technology

01 March 2018 - Ongoing

Government of Nepal, Ministry of Communication and Information Technology

National Artificial Intelligence (AI) Policy Development.

Address: Kathmandu, Nepal

Status: Member

22 July 2024 - Ongoing

International Journal of Information Communication Technology and Digital Convergence (IJICTDC)

Overseeing the editorial process, maintaining journal quality, representing the journal to the academic community, addressing ethical considerations, and managing the editorial team..

Address: Seoul, South Korea

Status: Editor in Chief

13 Aug. 2024 - Ongoing

Fuzzy Logic and Artificial Intelligence Laboratory at Jeonbuk National University

Machine learning and Deep learning based research

Address: Jeonju City, South Korea

Status: Researcher

01 March 2021 - 28 Feb. 2022

Ministry of Home Affairs (Government of Nepal)

National Information Collection and Transfer

Address: Singhdurbar, Kathmandu, Nepal

Status: IT officer

27 April 2015 - Aug 22 2016

NAST Engineering College

Assistant Professor and Department head.

Address: Dhangadhi, Kailali, Nepal

Status: Head of Department

12 Feb 2013 - 27 April 2015

PROJECTS

Music Rhythm Segmentation

Jan. 2022 - June 2022

Article title:Tracking the Rhythm: Pansori Rhythm Segmentation and Classification Methods and Datasets.[SCIE Journal]
Music rhythm classification and segmetation dataset.
Korean traditional music (Pansori) dataset.
Classification and semantic segmentation methods for music rhythm segmentation.
HRnet, Novel optimized network and DeepLabV3 network architectures.
Puplished on Applied Sciences (MDPI) in Sep., 2022

Music Source Separation

Oct. 2021 - Feb. 2022

Article title:High-Resolution Representation Learning and Recurrent Neural Network for Singing Voice Separation.[SCIE Journal]
Music source separation using novel dataset.
Modified HRnet and ablation study.
Comparation on public dataset and state of art result.
Puplished on Circuits, Systems, and Signal Processing (Springer) in Sep., 2022

Music genre calssification

Dec. 2020 - Sep. 2021

Article title:Multi-modal, Multi-task and Multi-label for Music Genre Classification and Emotion Regression.[IEEE Conference]
Multimodal, multi-task and multi-label DNN.
Optimized neural network.
44 Class categores of music genra.
Puplished on ICTC2021 in Dec. 2021

Plant Disease Classification

Oct. 2020 - March 2021

Article title:An Incremental Learning for Plant Disease classification.[IEEE Conference]
Life-long learning method in DNN.
ResNet18 and ResNet50 neural network.
High performance.
Puplished on ICTC2021 in Dec. 2021

Music Video Affective Computing (Unsupervised)

Sep. 2020 - Jan. 2021

Article title:Music video Emotion Classification using Slow-fast Audio-video Network and Unsupervised Feature Representation.[SCIE Journal]
Unsupervised and supervised music video emotion classification dataset
Autoencoder architecture with audio adn video information.
Slow-fast audio-video network to capture spatial and temporal information of music and video.
Train time information sharing and boosting modules.
Puplished on Scientific reports (nature) in Oct., 2021

Music Video Affective Computing (Supervised)

Sep. 2020 - Jan. 2021

Article title:Deep-Learning-Based Multimodal Emotion Classification for Music Videos.[SCIE Journal]
Music video emotion classification dataset (Inproved and Extended version)
Ablation study on unimodla and multimodal using music, video and facial expression.
Network complexity reduction using novel channel and filter separable convolution.
Train time information sharing and boosting modules.
End-to-end training, better result on visual and statistical analysis.
Puplished on Sensors in July, 2021

Facemask States Detection

Nov. 2020 - Jan. 2021

Article title:Deep Learning Based Face Mask Status Detection for COVID-19.[Conference]
Semi-automatic visual object labeling tool
Facemask detection dataset with three cass categories of with mask, without mask and wrong weared mask.
Mask detetion using Faster-RCNN, Cascade FRCNN, FPN and Cascade FPN.
Comparision, visualization and analysis of sustem ability and applications.
Puplished on ICMLT Conferencein April 2021

Sound Event Labeling Tool

Dec. 2019 - Sep. 2020

Article title:A Semi-automatic Sound Annotation Tool for Audio/Video data.[SCIE Journal]
Semi-automatic sound event annotation tool using audio and video as input.
Automatic event detector is used to detect the audio event.
Based on the automatic detector result, an human annotation have to refine the annotation boundary.
Easy to use, better audio visualization, python based and output in easy CSV data file.
Diversified annotation tool for any rare sound event.
Puplished on Livestock Sciencein Feb. 2022

Music Source Seperation

June 2020 - Nov. 2020

Article title:Parallel Stacked Hourglass Network for Music Source Separation.[SCIE Journal]
Prepared Korean traditional song (Pansori) dataset with 3 sources.

Korean traditional music Pansori dataset, MIR-1K dataset, and DSD100 dataset used in experiment.

Proposed a novel parallel stacked hourglass network (PSHN) with multiple band spectrograms.

Ablation study on proposed and past architecture.

State-of-art result.

Puplished on IEEE Access in Nov. 2020

CNN Based Sound Event Detection in Cowshed

Dec. 2019 - Sep. 2020

Article title:Sound Event Detection in Cowshed using Synthetic data and Convolutional Neural Network[IEEE Conference]

CNN based sound event detection.

Sound event annotaion tool.

Sound localization and classification.

Puplished on ICTC2020 in Sep. 2020

Cow Sound Event Localization and Classification

Dec. 2019 - Sep. 2020

Article title:Visual Object Detector for Cow Sound Event Detection[SCIE Journal]

Cow sound event detection dataset with 4 class categories.

CNN used for sound event detection using Cow sound dataset and UrbanSound8K dataset.

Visual object detection architecture (F-RCNN, CF-RCNN, FPN, C-FPC) used for audio event detection (in Log Mel-Spectrogram).

Compare the proposed CNN and Visual object detection architecture using three test dataset.

Puplished on IEEE Access in Sep. 2020

Music-Video Emotion Classification

Jan. 2019 - Sep. 2019

Article title:Deep Learning-Based Late Fusion of Multimodal Information for Emotion Classification of Music Video[SCIE Journal]

Music-Video emotion classification using audio and video multimodal network architecture.

Use pretrained CNN for audio and 3D video model (I3D and C3D).

The network learned features ate late fused and compare the impact of network feature fusion.

Cross validation and network feature fusion.

Puplished on Multimedia Tools and Applications in Sep. 2020

Music Video Emotion Analysis

Dec. 2018 - March 2019

Article title:Music-Video Emotion Analysis Using Late Fusion of Multimodal[Conference]

Music video emotion dataset of six class category.

Audio-video multimodal architecture.

C3D pretrained network and CNN pretrained audio network feature fusion.

Emotion representation in 2D emotion space.

Puplished on ITEEE 2019 Conference in 2019

Domestic Cat Sound Classification

Dec. 2017 - Sep. 2018

Article title:Domestic Cat Sound Classification Using Learned Features from Deep Neural Nets[SCI Journal]

CNN and CDBN network architecture.

Cat sound dataset preparation of 10 class categories.

Frequency division average pooling (FDAP) technique instead of global average pooling (GAP) to make a robust prediction using various frequency band features.

Audio augmentation and learned feature visualization.

Puplished on Applied Science in Sep 2018

Domestic Cat Sound Classification using Transfer Learning

Dec. 2017 - March 2018

Article title:Domestic Cat Sound Classification Using Transfer Learning[SCIE Journal]

Cat sound dataset with 10 class categories.

Use pretrained CNN for feature extraction and make feature classification.

Machine learning classifier and deep learning classifier comparision.

Ensemble and data augmentation.

Puplished on International Journal of Fuzzy Logic and Intelligent Systems in June 2018

Current Research

Multi-culture emotion analysis.

Vegetable disease detection and remedy system.

Remote sensing and GIS image analysis for agriculture field monitoring.

Wildlife activity monitoring.

Authentication system for vahicle.

Books Publications

Yagya Raj Pandeya Multimedia Information Processing using Deep Learning. (Under publication)

Yagya Raj Pandeya and Sharad Chandra Joshi, An Essential Guide to Computer Networks. (2006, in Nepal)

Achievements

President's Excellent Research Award ofJeonbuk National University (2021-02)

Winner Korean Government Scholarship Program (KGSP) (2016)

Awarded by Dean's List of Pokhara University in 2014

Language Skills

English Language

Korean Language

Hindi

Nepali

Good

Moderate(TOPIK-3)

Native

Native

Techincal Skills

Programming Languages

Deep learning Framework

Platforms

I.D.E Skills

Python, C, C++, PHP

TensorFlow, Keras, PyTorch

Linux, Windows, CUDA/Docker

Eclipse, UML, PyCharm

References

Prof. Joonwhoan Lee

Ph.D. Adviser

Institude: Jeonbuk National University

Ph No.: +82-63-270-2406, +82-010-9855-2406

Email: chlee@chonbuk.ac.kr

Prof. Shashidhar Ram Joshi

Master Adviser

Institude: Pokhara University

Ph No.: +977-01-5534070

Email: srjoshi@ioe.edu.np

Dr. Prem Bahadur Chand

Undergraduate Instructor and Co-worker

Institude: Dhangadhi Engineeing College (Pokhara University Affiliation)

Ph No.: +977-9858487111, 9848424687

Email: prem.chand@nast.edu.np