Ali Asghar Farhadi
#88,845
Most Influential Person Now
Ali Asghar Farhadi's AcademicInfluence.com Rankings
Ali Asghar Farhadicomputer-science Degrees
Computer Science
#2996
World Rank
#3140
Historical Rank
Artificial Intelligence
#435
World Rank
#442
Historical Rank
Database
#551
World Rank
#578
Historical Rank

Download Badge
Computer Science
Ali Asghar Farhadi's Degrees
- Bachelors Computer Engineering Sharif University of Technology
- Masters Computer Engineering Sharif University of Technology
- PhD Computer Science Sharif University of Technology
Similar Degrees You Can Earn
Why Is Ali Asghar Farhadi Influential?
(Suggest an Edit or Addition)Ali Asghar Farhadi's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- You Only Look Once: Unified, Real-Time Object Detection (2015) (22888)
- YOLOv3: An Incremental Improvement (2018) (12675)
- YOLO9000: Better, Faster, Stronger (2016) (10678)
- XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks (2016) (3206)
- Describing objects by their attributes (2009) (1958)
- Bidirectional Attention Flow for Machine Comprehension (2016) (1913)
- Unsupervised Deep Embedding for Clustering Analysis (2015) (1908)
- Target-driven visual navigation in indoor scenes using deep reinforcement learning (2016) (1175)
- Every Picture Tells a Story: Generating Sentences from Images (2010) (1099)
- Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding (2016) (858)
- YOLOv 3 : An Incremental Improvement (2018) (789)
- AI2-THOR: An Interactive 3D Environment for Visual AI (2017) (615)
- Defending Against Neural Fake News (2019) (549)
- From Recognition to Cognition: Visual Commonsense Reasoning (2018) (517)
- Recognition using visual phrases (2011) (455)
- Understanding egocentric activities (2011) (388)
- Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks (2016) (375)
- HellaSwag: Can a Machine Really Finish Your Sentence? (2019) (355)
- Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping (2020) (328)
- IQA: Visual Question Answering in Interactive Environments (2017) (307)
- Learning Everything about Anything: Webly-Supervised Visual Concept Learning (2014) (306)
- OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge (2019) (239)
- Attribute-centric recognition for cross-category generalization (2010) (218)
- Visual Semantic Navigation using Scene Priors (2018) (216)
- Actions ~ Transformations (2015) (215)
- Learning to Recognize Activities from the Wrong View Point (2008) (215)
- What’s Hidden in a Randomly Weighted Neural Network? (2019) (208)
- Situation Recognition: Visual Semantic Role Labeling for Image Understanding (2016) (203)
- Attribute Discovery via Predictable Discriminative Binary Codes (2012) (201)
- Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension (2017) (179)
- Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time (2022) (174)
- Robust fine-tuning of zero-shot models (2021) (166)
- Asynchronous Temporal Fields for Action Recognition (2016) (159)
- Label Refinery: Improving ImageNet Classification through Label Progression (2018) (157)
- Ranking Domain-Specific Highlights by Analyzing Edited Videos (2014) (142)
- Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images (2015) (141)
- Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning (2018) (136)
- MERLOT: Multimodal Neural Script Knowledge Models (2021) (134)
- Soft Threshold Weight Reparameterization for Learnable Sparsity (2020) (132)
- Supermasks in Superposition (2020) (131)
- Visual Semantic Planning Using Deep Successor Representations (2017) (126)
- VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases (2015) (121)
- FigureSeer: Parsing Result-Figures in Research Papers (2016) (121)
- Transfer Learning in Sign language (2007) (120)
- "What Happens If..." Learning to Predict the Effect of Forces in Images (2016) (115)
- Predicting Failures of Vision Systems (2014) (113)
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (2019) (111)
- Re$^3$: Re al-Time Recurrent Regression Networks for Visual Tracking of Generic Objects (2017) (109)
- RoboTHOR: An Open Simulation-to-Real Embodied AI Platform (2020) (107)
- A Diagram is Worth a Dozen Images (2016) (106)
- Solving Geometry Problems: Combining Text and Diagram Interpretation (2015) (104)
- Query-Reduction Networks for Question Answering (2016) (101)
- SeGAN: Segmenting and Generating the Invisible (2017) (98)
- Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos (2018) (92)
- Actor and Observer: Joint Modeling of First and Third-Person Videos (2018) (91)
- A latent model of discriminative aspect (2009) (85)
- Diagram Understanding in Geometry Questions (2014) (82)
- Discovering Neural Wirings (2019) (80)
- Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph (2019) (74)
- LCNN: Lookup-Based Convolutional Neural Network (2016) (72)
- Object-Centric Anomaly Detection by Attribute-Based Reasoning (2013) (72)
- Watching the World Go By: Representation Learning from Unlabeled Videos (2020) (68)
- Neural Speed Reading via Skim-RNN (2017) (66)
- VisualCOMET: Reasoning About the Dynamic Context of a Still Image (2020) (64)
- Deep Classifiers from Image Tags in the Wild (2015) (61)
- Scene Discovery by Matrix Factorization (2008) (56)
- Aligning ASL for Statistical Translation Using a Discriminative Word Model (2006) (55)
- MERLOT RESERVE: Neural Script Knowledge through Vision and Language and Sound (2022) (54)
- Modeling for diversifying electricity supply by maximizing renewable energy use in Ebino city southern Japan (2017) (52)
- Two Body Problem: Collaborative Visual Task Completion (2019) (51)
- Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off! (2015) (51)
- Imagine This! Scripts to Compositions to Videos (2018) (49)
- Re3 : Real-Time Recurrent Regression Networks for Object Tracking (2017) (49)
- PhotoShape: Photorealistic Materials for Large-Scale Shape Collections (2018) (48)
- Stating the Obvious: Extracting Visual Common Sense Knowledge (2016) (46)
- Multi-attribute Queries: To Merge or Not to Merge? (2013) (46)
- Using Classification to Protect the Integrity of Spectrum Measurements in White Space Networks (2011) (45)
- The benefits and challenges of collecting richer object annotations (2010) (45)
- Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension (2018) (45)
- Grounded Situation Recognition (2020) (44)
- Incorporating Scene Context and Object Layout into Appearance Modeling (2014) (43)
- Salient Montages from Unconstrained Videos (2014) (42)
- Towards Transparent Systems: Semantic Characterization of Failure Modes (2014) (41)
- Adding Unlabeled Samples to Categories by Learned Attributes (2013) (41)
- ELASTIC: Improving CNNs With Dynamic Scaling Policies (2018) (40)
- Who Let the Dogs Out? Modeling Dog Behavior from Visual Data (2018) (40)
- Visalogy: Answering Visual Analogy Questions (2015) (38)
- Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects (2016) (37)
- Butterfly Transform: An Efficient FFT Based Neural Architecture Design (2019) (37)
- ProcTHOR: Large-Scale Embodied AI Using Procedural Generation (2022) (35)
- See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content (2017) (35)
- Learning Neural Network Subspaces (2021) (34)
- A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks (2020) (34)
- It's All About the Data (2010) (33)
- Use the Force, Luke! Learning to Predict Physical Forces by Simulating Effects (2020) (33)
- Commonly Uncommon: Semantic Sparsity in Situation Recognition (2016) (32)
- LanguageRefer: Spatial-Language Model for 3D Visual Grounding (2021) (32)
- PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World (2021) (31)
- AJILE Movement Prediction: Multimodal Deep Learning for Natural Human Neural Recordings and Video (2017) (31)
- Action Recognition in the Presence of One Egocentric and Multiple Static Cameras (2014) (30)
- Patching open-vocabulary models by interpolating weights (2022) (25)
- Structured Set Matching Networks for One-Shot Part Labeling (2017) (25)
- DOCK: Detecting Objects by Transferring Common-Sense Knowledge (2018) (24)
- Much Ado About Time: Exhaustive Annotation of Temporal Data (2016) (24)
- Query-Regression Networks for Machine Comprehension (2016) (21)
- Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing (2015) (21)
- Conditional Driving from Natural Language Instructions (2019) (20)
- What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning (2019) (16)
- Learning Generalizable Visual Representations via Interactive Gameplay (2021) (16)
- Learning to Select and Order Vacation Photographs (2015) (15)
- TuringAdvice: A Generative and Dynamic Evaluation of Language Use (2021) (15)
- What does a platypus look like? Generating customized prompts for zero-shot image classification (2022) (14)
- Editing Models with Task Arithmetic (2022) (14)
- Building a dictionary of image fragments (2012) (14)
- Summarizing Unconstrained Videos Using Salient Montages (2017) (14)
- Object Categorization: Words and Pictures: Categories, Modifiers, Depiction, and Iconography (2009) (13)
- Probing Text Models for Common Ground with Visual Representations (2020) (13)
- Multi-Resolution Language Grounding with Weak Supervision (2014) (13)
- Evaluating Machines by their Real-World Language Use (2020) (13)
- Phrasal Recognition (2013) (12)
- Unlabeled Data Improves Word Prediction (2009) (12)
- On the Application of Genetic Programming for New Generation of Ground Motion Prediction Equations (2015) (12)
- Enabling AI at the edge with XNOR-networks (2020) (12)
- Probing Contextual Language Models for Common Ground with Visual Representations (2020) (11)
- Semantic Understanding of Professional Soccer Commentaries (2012) (10)
- Toward a Taxonomy and Computational Models of Abnormalities in Images (2015) (10)
- Spectral acceleration prediction using genetic programming based approaches (2021) (10)
- Unlabeled data improvesword prediction (2009) (10)
- Discriminative and consistent similarities in instance-level Multiple Instance Learning (2015) (10)
- Visual Reaction: Learning to Play Catch With Your Drone (2019) (9)
- Assessing the Applicability of Ground‐Motion Models for Induced Seismicity Application in Central and Eastern North America (2018) (9)
- In the Wild: From ML Models to Pragmatic ML Systems (2020) (9)
- Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game (2019) (9)
- Ranking Highlights in Personal Videos by Analyzing Edited Videos (2016) (9)
- Layer-Wise Data-Free CNN Compression (2020) (9)
- Semantic Highlight Retrieval and Term Prediction (2017) (9)
- Pushing it out of the Way: Interactive Visual Navigation (2021) (9)
- Objaverse: A Universe of Annotated 3D Objects (2022) (8)
- Retrospectives on the Embodied AI Workshop (2022) (8)
- Non-Driven wheels Application for Intelligent Multi-Objective Control of Hybrid Vehicles (2012) (7)
- Visual Commonsense Graphs: Reasoning about the Dynamic Context of a Still Image (2020) (6)
- Forward Compatible Training for Large-Scale Embedding Retrieval Systems (2021) (6)
- Are We Overfitting to Experimental Setups in Recognition (2020) (6)
- LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes (2021) (6)
- Image Classification and Retrieval from User-Supplied Tags (2014) (5)
- ELASTIC: Improving CNNs with Instance Specific Scaling Policies (2018) (5)
- Assessing Predictive Capability of Ground‐Motion Models for Probabilistic Seismic Hazard in Iran (2019) (5)
- Semantic highlight retrieval (2016) (5)
- Lo-fi: Distributed Fine-tuning without Communication (2022) (4)
- A Task-Oriented Approach for Cost-Sensitive Recognition (2016) (4)
- Object Manipulation via Visual Target Localization (2022) (3)
- Image segmentation via local higher order statistics (2003) (3)
- Break and Make: Interactive Structural Understanding Using LEGO Bricks (2022) (3)
- Exposing the Limits of Video-Text Models through Contrast Sets (2022) (3)
- LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time (2021) (2)
- Object Goal Navigation with End-to-End Self-Supervision (2022) (2)
- Neural Radiance Field Codebooks (2023) (2)
- The Introspective Agent: Interdependence of Strategy, Physiology, and Sensing for Embodied Agents (2022) (2)
- How to tell the difference between a cat and a dog? (2006) (2)
- RangeAugment: Efficient Online Augmentation with Range Learning (2022) (2)
- Matryoshka Representation Learning (2022) (2)
- Matryoshka Representations for Adaptive Deployment (2022) (2)
- A comprehensive analysis on a novel DC‐Excited Flux‐Switching Linear Motor as a new linear vehicle for transportation systems (2022) (1)
- FastFill: Efficient Compatible Model Update (2023) (1)
- Phone2Proc: Bringing Robust Robots Into Our Chaotic World (2022) (1)
- FLUID: A Unified Evaluation Framework for Flexible Sequential Data (2020) (1)
- LegoTron: An Environment for Interactive Structural Understanding (2021) (1)
- Forward Compatible Training for Representation Learning (2021) (1)
- Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text (2021) (1)
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (2020) (1)
- A Systematic Approach for News Caption Generation (2014) (1)
- Hydraulic anti-lock, anti-skid braking system using fuzzy controller (2016) (1)
- PhotoShape (2018) (1)
- Transferring Common-Sense Knowledge for Object Detection (2018) (1)
- Designing representational architectures in recognition (2011) (1)
- Appendix : Asynchronous Temporal Fields for Action Recognition (2017) (0)
- Linguistic performance in Epileptic patients under treatment with old generation of anti-epileptic drugs (2015) (0)
- Stable and low-precision training for large-scale vision-language models (2023) (0)
- ALERT: Predicting Failures (Supplementary Material) (2014) (0)
- Modeling and Simulation ofElectromagnetic Conducted EmissionDue toPower Electronics (2006) (0)
- Expanding Training Sets with Unlabeled Samples by Learned Attributes (2013) (0)
- Solving geometry problems (2015) (0)
- Abnormal Object Recognition: A Comprehensive Study (2014) (0)
- THE STUDY AND DETERMINATION OF DEPTH - AREA - DURATION (DAD) OF RAINFALL IN HAMEDAN, ZANJAN AND QAZVIN PROVINCES (2011) (0)
- DataComp: In search of the next generation of multimodal datasets (2023) (0)
- Higher Order Statistics in Computer Vision (2002) (0)
- Higher order statistics in computer vision: analysis of images and detection of extraneous objects in images. (2002) (0)
- How to Tell the Difference Between a Dog and a Cat (2004) (0)
- A New Method for Eye Printing (2004) (0)
- An application of linear predictive coding and computational geometry to iris recognition (2006) (0)
- Probing Language Models for Common Ground with Visual Representations (2020) (0)
- Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement (2023) (0)
- Final NeurIPS 2022 Conference Paper 779 Reviewer DT 9 D Comment (2022) (0)
- Appreciation to IJCV Reviewers (2012) (0)
- Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents (2022) (0)
- It's All About the Data This paper explains how training data is important for many computer vision algorithms and presents case studies of how the Internet can be used to obtain high-quality data. (2010) (0)
- Toward visual intelligence (2017) (0)
- Detecting Strange Objects via Visual Attributes (2014) (0)
- Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics (2023) (0)
- Self-Supervised Object Goal Navigation with In-Situ Finetuning (2022) (0)
- Learning and transferring movie styles (2017) (0)
- Calibration and assessment of synthetic unit hydrograph construction methods in Ekbatan Dam watershed in Hamedan. (2007) (0)
This paper list is powered by the following services: