Soumen Chakrabarti
#109,631
Most Influential Person Now
Indian engineer
Soumen Chakrabarti's AcademicInfluence.com Rankings
Soumen Chakrabarticomputer-science Degrees
Computer Science
#4951
World Rank
#5230
Historical Rank
Data Mining
#224
World Rank
#225
Historical Rank
Algorithms
#377
World Rank
#382
Historical Rank
Database
#6319
World Rank
#6551
Historical Rank

Download Badge
Computer Science
Soumen Chakrabarti's Degrees
- PhD Computer Science University of California, Berkeley
Similar Degrees You Can Earn
Why Is Soumen Chakrabarti Influential?
(Suggest an Edit or Addition)According to Wikipedia, Soumen Chakrabarti is an Indian computer scientist and professor in the Department of Computer Science and Engineering at IIT Bombay. He is known for his work onThe CLEVER Web page ranking system based on hyperlinks, related to PageRank.Focused crawlers, which are Web crawlers guided by page topic classifiers.Keyword search on graph databases, later popularized by Facebook graph search.Named entity disambiguation in Web text.He is author of an early book on Web search and mining.
Soumen Chakrabarti's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery (1999) (1818)
- Keyword searching and browsing in databases using BANKS (2002) (1061)
- Enhanced hypertext categorization using hyperlinks (1998) (942)
- Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text (1998) (837)
- Mining the Web's Link Structure (1999) (585)
- Bidirectional Expansion For Keyword Search on Graph Databases (2005) (575)
- The Morgan Kaufmann Series in Data Management Systems (1999) (526)
- Collective annotation of Wikipedia entities in web text (2009) (491)
- Annotating and searching web tables using entities, types and relationships (2010) (421)
- Data mining for hypertext: a tutorial survey (2000) (344)
- Generalizing Across Domains via Cross-Gradient Training (2018) (338)
- Mining the web - discovering knowledge from hypertext data (2002) (312)
- Accelerated focused crawling through online relevance feedback (2002) (283)
- Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies (1998) (280)
- Flow and stretch metrics for scheduling continuous job streams (1998) (279)
- Dynamic personalized pagerank in entity-relation graphs (2007) (231)
- BANKS: Browsing and Keyword Searching in Relational Databases (2002) (184)
- Parallel randomized load balancing (1995) (180)
- Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction (2001) (178)
- Mining Surprising Patterns Using Temporal Description Length (1998) (163)
- Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases (1997) (158)
- Enhanced topic distillation using text, markup tags, and hyperlinks (2001) (151)
- Improved Scheduling Algorithms for Minsum Criteria (1996) (149)
- The structure of broad topics on the web (2002) (146)
- Fast and accurate text classification via multiple linear discriminant projections (2003) (139)
- Structured learning for non-smooth ranking losses (2008) (121)
- Learning to rank networked entities (2006) (111)
- Collective Entity Resolution with Multi-Focal Attention (2016) (104)
- Data Mining - Know It All (2008) (99)
- Keyword Search in Databases (2007) (92)
- Shuffling a Stacked Deck: The Case for Partially Randomized Ranking of Search Engine Results (2005) (90)
- Modeling the benefits of mixed data and task parallelism (1995) (84)
- Distributed Hypertext Resource Discovery Through Examples (1999) (83)
- Optimizing scoring functions and indexes for proximity search in type-annotated corpora (2006) (81)
- Proceedings of the 19th international conference on World wide web (2010) (77)
- Global communication analysis and optimization (1996) (74)
- Monitoring the dynamic web to respond to continuous queries (2003) (73)
- Document Classification Through Interactive Supervision of Document and Term Labels (2004) (69)
- Scaling multi-class support vector machines using inter-class confusion (2002) (67)
- Fast algorithms for topk personalized pagerank queries (2008) (67)
- Enhanced Answer Type Inference from Questions using Sequential Models (2005) (66)
- Implementing an irregular application on a distributed memory multiprocessor (1993) (65)
- Learning random walks to rank nodes in graphs (2007) (63)
- Proceedings of the 19th International Conference on World Wide Web (WWW 2010) (2010) (62)
- Cross-training: learning probabilistic mappings between topics (2003) (62)
- Is question answering an acquired skill? (2004) (62)
- OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction (2020) (53)
- Surfing the Web Backwards (1999) (52)
- Learning a Linear Influence Model from Transient Opinion Dynamics (2014) (51)
- Open-domain quantity queries on web tables: annotation, response, and consensus models (2014) (50)
- Learning joint query interpretation and response ranking (2012) (49)
- Similarity and Clustering (2003) (48)
- Earth Mover's Distance Pooling over Siamese LSTMs for Automatic Short Answer Grading (2017) (46)
- Knowledge Graph and Corpus Driven Segmentation and Answer Inference for Telegraphic Entity-seeking Queries (2014) (44)
- IMoJIE: Iterative Memory-Based Joint Open Information Extraction (2020) (43)
- Question Answering Over Temporal Knowledge Graphs (2021) (41)
- The influence of search engines on preferential attachment (2005) (41)
- Breaking Through the Syntax Barrier: Searching with Entities and Relations (2004) (40)
- Discriminative Link Prediction using Local, Community, and Global Signals (2016) (39)
- Index design and query processing for graph conductance search (2011) (38)
- Randomized load balancing for tree-structured computation (1994) (38)
- Spectral filtering for resource discovery (1998) (37)
- Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision (2018) (36)
- Resource scheduling for parallel database and scientific applications (1996) (34)
- Neural architecture for question answering using a knowledge graph and web corpus (2017) (34)
- Adaptive control for packet video (1994) (34)
- Learning Parameters in Entity Relationship Graphs from Ranking Preferences (2006) (32)
- Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs (2019) (30)
- Temporal Knowledge Base Completion: New Algorithms and Evaluation Protocols (2020) (29)
- First ground‐based measurements of OI 6300 Å daytime aurora over Boston in response to the 30 October 2003 geomagnetic storm (2004) (28)
- Learning to rank for quantity consensus queries (2009) (28)
- Runtime Support for Portable Distributed Data Structures (1995) (27)
- Using Memex to archive and mine community Web browsing experience (2000) (26)
- Models and Scheduling Algorithms for Mixed Data and Task Parallel Programs (1997) (26)
- Recent results in automatic Web resource discovery (1999) (26)
- A Deep Generative Model for Code-Switched Text (2019) (25)
- SCAD: collective discovery of attribute values (2011) (25)
- Chapter 57 – Fast and accurate text classification via multiple linear discriminant projections (2002) (23)
- Neural Program Induction for KBQA Without Gold Programs or Query Annotations (2019) (23)
- Multipol: A Distributed Data Structure Library (1995) (21)
- Focused Web Crawling (2009) (21)
- Enhancing Search with Structure (2010) (20)
- Discriminative Link Prediction Using Local Links, Node Features and Community Structure (2013) (20)
- A Two-Stage Framework for Computing Entity Relatedness in Wikipedia (2017) (19)
- Federated Database Systems (2009) (18)
- Biography and Position Statement. (2010) (18)
- SPIN: searching personal information networks (2005) (17)
- Compressed data structures for annotated web search (2012) (17)
- On Computing Entity Relatedness in Wikipedia, with Applications (2020) (16)
- Distributed data structures and algorithms for Gröbner basis computation (1994) (16)
- Deep Exogenous and Endogenous Influence Combination for Social Chatter Intensity Prediction (2020) (16)
- Diversity in ranking via resistive graph centers (2011) (16)
- Improved approximation algorithms for minsum criteria (1996) (16)
- Topic Distillation and Spectral Filtering (1999) (14)
- New Embedded Representations and Evaluation Protocols for Inferring Transitive Relations (2018) (13)
- Memex: A Browsing Assistant for Collaborative Archiving and Mining of Surf Trails (2000) (13)
- Relay-Linking Models for Prominence and Obsolescence in Evolving Networks (2016) (13)
- Knowledge Base Completion: Baseline strikes back (Again) (2020) (12)
- Keeyword Search in Databases (2001) (12)
- Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering (2021) (12)
- Web-CAM: monitoring the dynamic Web to respond to continual queries (2004) (12)
- Index Design for Dynamic Personalized PageRank (2008) (12)
- Improved Sentiment Detection via Label Transfer from Monolingual to Synthetic Code-Switched Text (2019) (12)
- Social media: source of information or bunch of noise (2011) (11)
- On the Correctness of a Distributed Memory Gröbner basis Algorithm (1993) (11)
- Mining Themes From Bookmarks (2000) (11)
- GIRNet: Interleaved Multi-Task Recurrent State Sequence Models (2018) (11)
- Scene Graph based Image Retrieval - A case study on the CLEVR Dataset (2019) (10)
- User Interaction in the BANKS System. (2003) (9)
- Web-scale entity-relation search architecture (2011) (9)
- Sulphide free unhairing - Studies on ozone based depilation (2006) (9)
- Data Structures for Irregular Applications (1993) (9)
- Learning Linear Influence Models in Social Networks from Transient Opinion Dynamics (2019) (9)
- Short ranged attraction and long ranged repulsion between two solute particles in a subcritical liquid solvent (2006) (9)
- Multi-task Learning for Target-dependent Sentiment Classification (2019) (8)
- Portable Parallel Irregular Applications (1995) (8)
- Learning to Rank in Vector Spaces and Social Networks (2007) (8)
- Curating and Searching the Annotated Web (2009) (7)
- Special techniques for synthesis of high solid resins and applications in surface coatings (2003) (7)
- The Kauwa-Kaate Fake News Detection System: Demo (2020) (6)
- Adversarial Permutation Guided Node Representations for Link Prediction (2020) (6)
- Proceedings of the 2008 International Conference on Web Search and Data Mining (2008) (6)
- Task-Specific Representation Learning for Web-Scale Entity Disambiguation (2018) (6)
- Parallel randomized load balancing (Preliminary Version). (1995) (6)
- Automated Early Leaderboard Generation from Comparative Tables (2018) (6)
- Conditional Models for Non-smooth Ranking Loss Functions (2009) (6)
- A Frequency Offset Estimation Scheme for OFDM Based UWB Systems (2006) (5)
- Joint Bootstrapping of Corpus Annotations and Entity Types (2013) (5)
- Fuzzy MCDM (2009) (5)
- Interpretable Complex Question Answering (2020) (5)
- Deep Neural Matching Models for Graph Retrieval (2020) (4)
- Discovering Links Between Lexical and Surface Features in Questions and Answers (2004) (4)
- User interaction in the BANKS system: a demonstration (2003) (4)
- Data mining for hypertext (tutorial session) (title only) (2000) (4)
- Interpretable Neural Subgraph Matching for Graph Retrieval (2022) (4)
- Mitigating the Effect of Out-of-Vocabulary Entity Pairs in Matrix Factorization for KB Inference (2018) (4)
- Features and Aggregators for Web-scale Entity Search (2013) (4)
- Analysis of Reference and Citation Copying in Evolving Bibliographic Networks (2019) (3)
- Chapter 5 – Supervised Learning (2003) (3)
- The UV-VIS spectrometer for the ExoMars mission (2006) (3)
- Guest Editors' Introduction: Special Section on Mining and Searching the Web (2004) (3)
- Web-scale entity annotation using MapReduce (2013) (3)
- False Negative Rate (2009) (3)
- Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation (2002) (3)
- Hypertext databases and data mining (1999) (3)
- Privacy Preserving Link Prediction with Latent Geometric Network Models (2019) (2)
- Mortality of Dalbergia sisso Roxb. (Shisham) in Subathu Forest Range of Solan, Himachal Pradesh: a Case Study (2004) (2)
- Integrating Transductive and Inductive Embeddings Improves Link Prediction Accuracy (2021) (2)
- Understanding Error Control Coding (1994) (2)
- Large-scale Mortality of Willow in Lahaul Valley, District Lahaul & Spiti, Himachal Pradesh (2003) (2)
- HIClass: Hyper-interactive Text Classification by Interactive Supervision of Document and Term Labels (2004) (2)
- Differentially Private Link Prediction with Protected Connections (2020) (2)
- Interactive Focused Crawler : Setup , Monitoring and Control through User Feedback (2003) (2)
- Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy (2006) (2)
- "Open-domain question answering using a knowledge graph and web corpus" by Uma Sawant, Soumen Chakrabarti and Ganesh Ramakrishnan with Martin Vesely as coordinator (2018) (2)
- Chapter 2 - Crawling the Web (2003) (1)
- Ranking State-of-the-art Papers via Incomplete Tournaments Induced by Citations from Performance Tables (2018) (1)
- Exploiting the dynamic networking effects of the web (2005) (1)
- Fully Temporal Relation (2009) (1)
- Impact of Fading Correlation on Adaptive Array in Cooperative Relay Networks (2009) (1)
- Chapter 9 – The Future of Web Mining (2003) (1)
- Maximum Common Subgraph Guided Graph Retrieval: Late and Early Interaction Networks (2022) (1)
- Efficient Resource Scheduling in Multiprocessors (1996) (1)
- Hypertext data mining (tutorial AM-1) (2000) (1)
- Web Search and Information Retrieval (2003) (1)
- Fact-Oriented Modeling (2009) (1)
- Fault Tolerant Applications (2009) (1)
- Sic Transit Gloria Manuscriptum: Two Views of the Aggregate Fate of Ancient Papers (2015) (1)
- Joint Autoregressive and Graph Models for Software and Developer Social Networks (2021) (1)
- Data-based research at IIT Bombay (2013) (1)
- Random Allocation of Jobs with Weights and Precedence (1996) (1)
- Diffusion de caracteristiques sur des hyperliens (1999) (0)
- Cytologia 72(4): 419–425, 2007 (2008) (0)
- Chapter 8 – Resource Discovery (2003) (0)
- Studies in jurisprudence and international law (0)
- Topic Sensitive Attention on Generic Corpora Corrects Sense Bias in Pretrained Embeddings (2019) (0)
- MODELIZACION NUMERICA DEL DESARROLLO DE TENSIONES DE RETRACCION EN MATERIALES ESTABILIZADOS PARA FIRMES (2001) (0)
- Review on Practical Ship Design (2000) (0)
- New closed-form bounds on the partition function (2008) (0)
- Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection (2022) (0)
- Social Network Analysis (2003) (0)
- Web Monitoring for Light-Weight Devices (0)
- NLP Service APIs and Models for Efficient Registration of New Clients (2020) (0)
- Joint Matrix-Tensor Factorization for Knowledge Base Inference (2017) (0)
- Dynamical variability of OI 630.0nm dayglow emissions over low geomagnetic latitudes (2006) (0)
- Session details: Description and Analysis (2002) (0)
- A Question of Identity : What Should Aadhaar Be Like ? (0)
- Management and organisation of buddhist art objects in the museums of West Bengal (2006) (0)
- Searching and Mining Fine-Grained Semi-Structured Data (2002) (0)
- Text Search-Enhanced with Types and Entities (2009) (0)
- Web Search results' ranking: PageRank, HITS and related work (2004) (0)
- PANORAMICA DE LOS METODOS DE ESTABILIZACION Y SU COMPORTAMIENTO EN LAS CARRETERAS LOCALES AUSTRALIANAS (2001) (0)
- Automatic Web Resource Discovery (1999) (0)
- Dynami cPersonalize dPageran ki nEntity-Relatio nGraphs (2007) (0)
- Searching and Ranking in Entity-Relation Graphs Dual Degree Project Report (2008) (0)
- Parallel Data Structures for Symbolic (1995) (0)
- Proceedings of the 19th International Conference on World Wide Web, WWW 2010, Raleigh, North Carolina, USA, April 26-30, 2010 (2010) (0)
- Making Web-scale Entity-Relationship Search a Reality (2010) (0)
- Hubs and Authorities: Spreading Out and Zooming In (2001) (0)
- Chapter 6 – Semisupervised Learning (2003) (0)
- A Turbo Equalizer with RQLI Encoder (2007) (0)
- Efficient Spatial Representation for Entity-Typing (2017) (0)
- Dramatis Personae of Indian Anthropology Prof. Surajit Chandra Sinha (1926 - 2002) A Great Scholar and a Teacher of Anthropology (2002) (0)
- Performance of trellis coded 8-PSK and 8-DPSK with convolutional interleaver in a fading mobile channel (1994) (0)
- More Accurate Entity Ranking Using Knowledge Graph and Web Corpus (2017) (0)
- Proceedings of the International Conference on Web Search and Web Data Mining, WSDM 2008, Palo Alto, California, USA, February 11-12, 2008 (2008) (0)
- Dwitya biswayuddhakalin Bangla natak o natyasala (1998) (0)
- The Performance Capabillities and Limitations of Gain-Limitated Propulsion Systems (1999) (0)
- SOVA-Based Turbo Equalization and Decoding for Indoor Wireless Channels (2007) (0)
- Incomplete Gamma Integrals for Deep Cascade Prediction Using Content, Network, and Exogenous Signals (2021) (0)
- Kok-boroker Utsa Sandhane (2000) (0)
- Hands-on Space Experiments from Cradle to Grave: The Role of the Sounding Rocket Program in Developing Human Infrastructure (2005) (0)
- A town in the rural milieu Baruipur, West Bengal (2002) (0)
- CROSS-GRADIENT TRAINING (2018) (0)
- Feature transmission via hyperlinks (1999) (0)
- Knowledge Extraction and Inference from Text: Shallow, Deep, and Everything in Between (2018) (0)
- PROTON AND ELECTRON AURORA OVER EISCAT: OPTICAL SIGNATURE AND ASSOCIATED IONOSPHERIC PERTURBATIONS (2003) (0)
This paper list is powered by the following services:
Other Resources About Soumen Chakrabarti
What Schools Are Affiliated With Soumen Chakrabarti?
Soumen Chakrabarti is affiliated with the following schools: