ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.05237
  4. Cited By
S$^3$FD: Single Shot Scale-invariant Face Detector
v1v2v3 (latest)

S3^33FD: Single Shot Scale-invariant Face Detector

17 August 2017
Shifeng Zhang
Xiangyu Zhu
Zhen Lei
Hailin Shi
Xiaobo Wang
Stan Z. Li
    CVBM
ArXiv (abs)PDFHTML

Papers citing "S$^3$FD: Single Shot Scale-invariant Face Detector"

50 / 227 papers shown
Title
FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes
FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes
Wasim Ahmad
Yan-Tsung Peng
Yuan-Hao Chang
CVBM
81
0
0
13 Jun 2025
Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework
Lightweight Joint Audio-Visual Deepfake Detection via Single-Stream Multi-Modal Learning Framework
Kuiyuan Zhang
Wenjie Pei
Rushi Lan
Yifang Guo
Zhongyun Hua
15
0
0
09 Jun 2025
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition
Thai-Binh Nguyen
T. Nguyen
Quoc Truong Do
Chi Mai Luong
96
0
0
05 Jun 2025
Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust
Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust
Evdoxia Taka
Debadyuti Bhattacharya
Joanne Garde-Hansen
Sanjay Sharma
Tanaya Guha
10
0
0
02 Jun 2025
Multimodal Assessment of Speech Impairment in ALS Using Audio-Visual and Machine Learning Approaches
Multimodal Assessment of Speech Impairment in ALS Using Audio-Visual and Machine Learning Approaches
Francesco Pierotti
Andrea Bandini
18
0
0
27 May 2025
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling
Cheng Yifan
Zhang Ruoyi
Shi Jiatong
59
0
0
21 May 2025
Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors
Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors
Richard Schmit
ObjD
179
0
0
30 Apr 2025
Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields
Generative Fields: Uncovering Hierarchical Feature Control for StyleGAN via Inverted Receptive Fields
Zhuo He
Paul Henderson
Nicolas Pugeault
GAN
88
0
0
24 Apr 2025
Towards Realistic Low-Light Image Enhancement via ISP Driven Data Modeling
Towards Realistic Low-Light Image Enhancement via ISP Driven Data Modeling
Zhihua Wang
Yu Long
Qinghua Lin
Kai Zhang
Yize Zhang
Yuming Fang
Li Liu
Xiaochun Cao
94
0
0
16 Apr 2025
Archival Faces: Detection of Faces in Digitized Historical Documents
Archival Faces: Detection of Faces in Digitized Historical Documents
Marek Vaško
Adam Herout
Michal Hradiš
CVBM
132
0
0
01 Apr 2025
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing
Zhedong Zhang
Liang-Sheng Li
C. Yan
Chunshan Liu
Anton Van Den Hengel
Yuankai Qi
142
2
0
15 Mar 2025
PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net
Jun Yin
Yangfan He
Miao Zhang
Pengyu Zeng
Tianyi Wang
Shuai Lu
Xueqian Wang
DiffM
145
7
0
11 Mar 2025
B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning
B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning
Nikolaos Kaparinos
Vasileios Mezaris
CVBM
162
0
0
28 Jan 2025
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation
  Understanding
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
Yueqian Wang
Xiaojun Meng
Yijiao Wang
Jianxin Liang
Qun Liu
Dongyan Zhao
89
1
0
23 Dec 2024
Enhancing Remote Adversarial Patch Attacks on Face Detectors with Tiling
  and Scaling
Enhancing Remote Adversarial Patch Attacks on Face Detectors with Tiling and Scaling
Masora Okano
Koichi Ito
M. Nishigaki
Tetsushi Ohki
CVBMAAML
82
0
0
11 Dec 2024
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffMVGen
230
3
0
22 Nov 2024
Lipschitz-Driven Noise Robustness in VQ-AE for High-Frequency Texture Repair in ID-Specific Talking Heads
Lipschitz-Driven Noise Robustness in VQ-AE for High-Frequency Texture Repair in ID-Specific Talking Heads
Jian Yang
Xukun Wang
Wentao Wang
Guoming Li
Qihang Fang
Ruihong Yuan
Tianyang Wang
Jason Zhaoxin Fan
Yeying Jin
Zhaoxin Fan
VGen
156
1
0
01 Oct 2024
Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze
  Target Detection
Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection
Andrea Toaiari
Vittorio Murino
Marco Cristani
Cigdem Beyan
123
1
0
26 Sep 2024
Seeing Faces in Things: A Model and Dataset for Pareidolia
Seeing Faces in Things: A Model and Dataset for Pareidolia
Mark Hamilton
Simon Stent
Vasha Dutell
Anne Harrington
Jennifer Corbett
R. Rosenholtz
William T. Freeman
CVBM
52
1
0
24 Sep 2024
Human-Centric Transformer for Domain Adaptive Action Recognition
Human-Centric Transformer for Domain Adaptive Action Recognition
Kun-Yu Lin
Jiaming Zhou
Wei-Shi Zheng
97
7
0
15 Jul 2024
Similarity Distance-Based Label Assignment for Tiny Object Detection
Similarity Distance-Based Label Assignment for Tiny Object Detection
Shuohao Shi
Qiang Fang
Tong Zhao
Xin Xu
ObjD
116
4
0
02 Jul 2024
FDLite: A Single Stage Lightweight Face Detector Network
FDLite: A Single Stage Lightweight Face Detector Network
Yogesh Aggarwal
Prithwijit Guha
ObjDCVBM
90
3
0
27 Jun 2024
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based
  Text-to-Speech for Dubbing
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing
Neha Sahipjohn
Ashishkumar Gudmalwar
Nirmesh Shah
Pankaj Wasnik
R. Shah
111
7
0
13 Jun 2024
Better Sampling, towards Better End-to-end Small Object Detection
Better Sampling, towards Better End-to-end Small Object Detection
Zile Huang
Chong Zhang
Mingyu Jin
Fangyu Wu
Chengzhi Liu
Xiaobo Jin
ObjD
117
1
0
17 May 2024
Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Fares Abawi
Di Fu
Stefan Wermter
102
0
0
05 May 2024
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow
Felix Taubner
Prashant Raina
Mathieu Tuli
Eu Wern Teh
Chul Lee
Jinmiao Huang
3DHCVBM
85
4
0
15 Apr 2024
Tiny Machine Learning: Progress and Futures
Tiny Machine Learning: Progress and Futures
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Song Han
90
60
0
28 Mar 2024
AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and
  Dynamic Weighting Strategies
AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies
Rui Wang
Dengpan Ye
Long Tang
Yunming Zhang
Yueyun Shang
ViT
73
9
0
22 Mar 2024
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent
  Recognition and Out-of-scope Detection in Conversations
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Hanlei Zhang
Xin Wang
Hua Xu
Qianrui Zhou
Kai Gao
Jianhua Su
jinyue Zhao
Wenrui Li
Yanting Chen
151
5
0
16 Mar 2024
SecurePose: Automated Face Blurring and Human Movement Kinematics
  Extraction from Videos Recorded in Clinical Settings
SecurePose: Automated Face Blurring and Human Movement Kinematics Extraction from Videos Recorded in Clinical Settings
Rishabh Bajpai
Bhooma R. Aravamuthan
CVBM
78
1
0
21 Feb 2024
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
Gaoxiang Cong
Yuankai Qi
Liang-Sheng Li
Amin Beheshti
Zhedong Zhang
Anton Van Den Hengel
Ming-Hsuan Yang
Chenggang Yan
Qingming Huang
115
14
0
20 Feb 2024
Small Object Tracking in LiDAR Point Cloud: Learning the
  Target-awareness Prototype and Fine-grained Search Region
Small Object Tracking in LiDAR Point Cloud: Learning the Target-awareness Prototype and Fine-grained Search Region
Shengjing Tian
Yinan Han
Xiuping Liu
Xiantong Zhao
77
0
0
24 Jan 2024
Leveraging Visual Supervision for Array-based Active Speaker Detection
  and Localization
Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization
Davide Berghi
Philip J. B. Jackson
66
5
0
21 Dec 2023
SMILE: Multimodal Dataset for Understanding Laughter in Video with
  Language Models
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
Lee Hyun
Kim Sung-Bin
Seungju Han
Youngjae Yu
Tae-Hyun Oh
100
15
0
15 Dec 2023
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion
  Transformers
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers
Aaron Mir
Eduardo Alonso
Esther Mondragón
DiffM
93
2
0
11 Dec 2023
Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation
Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation
Zhipeng Du
Miaojing Shi
Jiankang Deng
ObjD
101
10
0
02 Dec 2023
Filter-Pruning of Lightweight Face Detectors Using a Geometric Median
  Criterion
Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion
Konstantinos Gkrispanis
Nikolaos Gkalelis
Vasileios Mezaris
VLMCVBM
79
4
0
28 Nov 2023
Multi-Modal Gaze Following in Conversational Scenarios
Multi-Modal Gaze Following in Conversational Scenarios
Yuqi Hou
Zhongqun Zhang
Nora Horanyi
Jaewon Moon
Yihua Cheng
Hyung Jin Chang
75
5
0
09 Nov 2023
CapST: Leveraging Capsule Networks and Temporal Attention for Accurate Model Attribution in Deep-fake Videos
Wasim Ahmad
Yan-Tsung Peng
Yuan-Hao Chang
Gaddisa Olani Ganfure
Sarwar Khan
62
1
0
07 Nov 2023
Global Structure-Aware Diffusion Process for Low-Light Image Enhancement
Global Structure-Aware Diffusion Process for Low-Light Image Enhancement
Jinhui Hou
Zhiyu Zhu
Junhui Hou
Hui Liu
Huanqiang Zeng
Hui Yuan
146
89
0
26 Oct 2023
The Importance of Anti-Aliasing in Tiny Object Detection
The Importance of Anti-Aliasing in Tiny Object Detection
Jinlai Ning
Michael W. Spratling
91
4
0
22 Oct 2023
Audio-visual child-adult speaker classification in dyadic interactions
Audio-visual child-adult speaker classification in dyadic interactions
Anfeng Xu
Kevin Huang
Tiantian Feng
Helen Tager-Flusberg
Shrikanth Narayanan
56
3
0
03 Oct 2023
How Robust is Google's Bard to Adversarial Image Attacks?
How Robust is Google's Bard to Adversarial Image Attacks?
Yinpeng Dong
Huanran Chen
Jiawei Chen
Zhengwei Fang
Xiaohu Yang
Yichi Zhang
Yu Tian
Hang Su
Jun Zhu
AAML
118
116
0
21 Sep 2023
Trash to Treasure: Low-Light Object Detection via
  Decomposition-and-Aggregation
Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation
Xiaohan Cui
Long Ma
Tengyu Ma
Jinyuan Liu
Xin-Yue Fan
Risheng Liu
ViT
98
11
0
07 Sep 2023
Holistic Dynamic Frequency Transformer for Image Fusion and Exposure
  Correction
Holistic Dynamic Frequency Transformer for Image Fusion and Exposure Correction
Xiaoke Shang
Gehui Li
Zhiying Jiang
Shaomin Zhang
Nai Ding
Jinyuan Liu
68
18
0
03 Sep 2023
Small Object Detection via Coarse-to-fine Proposal Generation and
  Imitation Learning
Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning
Xiang Yuan
Gong Cheng
Ke Yan
Qinghua Zeng
Junwei Han
ObjD
84
58
0
18 Aug 2023
Audio-visual video-to-speech synthesis with synthesized input audio
Audio-visual video-to-speech synthesis with synthesized input audio
Triantafyllos Kefalas
Yannis Panagakis
Maja Pantic
VGenDiffM
100
1
0
31 Jul 2023
A Unified Framework for Modality-Agnostic Deepfakes Detection
A Unified Framework for Modality-Agnostic Deepfakes Detection
Cai Yu
Peng-Wen Chen
Jiahe Tian
Jin Liu
Jiao Dai
Xi Wang
Yesheng Chai
Shan Jia
Siwei Lyu
Jizhong Han
77
0
0
26 Jul 2023
Householder Projector for Unsupervised Latent Semantics Discovery
Householder Projector for Unsupervised Latent Semantics Discovery
Yue Song
Jichao Zhang
N. Sebe
Wei Wang
100
5
0
16 Jul 2023
FTFDNet: Learning to Detect Talking Face Video Manipulation with
  Tri-Modality Interaction
FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction
Gang Wang
Peng Zhang
Jun Xiong
Fei Yang
Wei Huang
Yufei Zha
CVBM
71
1
0
08 Jul 2023
12345
Next