(1) Overview
Context
Megalithic monuments constitute a major cultural component of the Vindhyan highlands, where they are widely distributed across a large and varied landscape [1, 2, 3, 4, 5, 6]. The region’s diverse geographical and geomorphological features—such as plateaus, ridges, valleys, and river basins [7, 8, 9], created favorable conditions for the growth and continuity of megalithic communities. More than 200 megalithic sites have been identified between Sasaram in Bihar and Banda in Uttar Pradesh [4, 5, 6, 10, 11], forming a continuous cultural zone that reflects the long-term development of megalithic traditions in north-central India.
Despite this wide distribution, detailed data on Vindhyan megalithic sites is largely unavailable. Most previous research has been limited to surface reporting and general descriptions, without systematic recording of spatial, physical, or contextual attributes. No study has used GIS-based approach to analyze the relationship between megalithic monuments and their surrounding landscape. This lack of structured, accessible data has restricted comparative, analytical, and regional-scale interpretations.
The present dataset from Dantari Hill, located in Mirzapur District, Uttar Pradesh (Figure 1), aims to address this gap. The area contains a combination of megalithic monuments, microlithic scatters, painted rock shelters, and habitation remains. Field documentation and GIS-based spatial analysis were used to record the physical characteristics, spatial patterns, and environmental settings of these features. The dataset provides standardized and open-access data that can support future research on settlement organization, cultural development, and heritage management in the Vindhyan region.

Figure 1
Location map of Dantari Hill, Mirzapur District, Uttar Pradesh, India.
Spatial coverage
Description: Dantari Hill, Mirzapur District, Uttar Pradesh, India.
Northern boundary: 25.041597, 82.899264
Southern boundary: 25.019783, 82.910362
Eastern boundary: 25.023945, 82.91638507
Western boundary: 25.029957, 82.89535856
Temporal Coverage
The dataset represents cultural remains spanning a long period from the Early Iron Age to the Historic period, approximately first millennium BCE to the fifth century CE. Although no absolute dates are available for the Dantari Hill complex, its chronology is inferred from the material evidence and its comparison with the excavated and reported sites in South Asia [2, 3, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 5, 29, 30, 31, 32, 33, 34, 35, 36, 12, 13, 14, 15, 16, 17, 18].
(2) Methods
Steps
The dataset was generated through systematic field surveys and GIS-based spatial analysis of megalithic, microlithic, rock art, and habitation features at Dantari Hill. To create a comprehensive and structured record of the megalithic assemblage, a micro-documentation approach was adopted. This approach focused on spatial, morphological, and contextual data for each individual megalithic structure. Initial reconnaissance involved visual inspection of all visible megalithic monuments to assess their distribution, preservation status, and landscape setting. Based on this preliminary assessment, a suitable documentation strategy was developed. Each monument was assigned a unique identification number, marked non-invasively on or near the structure, to ensure systematic cataloguing and traceability during data collection and analysis. Precise geographical coordinates (latitude, longitude, and elevation) were recorded from each structure central point using a handheld differential GPS with an accuracy of ±3 m. Physical attributes of the megalithic monuments were systematically documented. Megalithic typology was classified based on surface-visible morphological characteristics into three categories: cairn, cairn with cist, and cairn with menhir. A cairn is defined as a barrow made of stones mixed with soil in a circular formation. A cairn with cist consists of a similar barrow having stones cist (chamber) usually at the center. A cairn with menhir is characterised by the presence of one vertical (upright) stone at the center of cairn. The size of each megalithic structure was measured using measuring tape along two perpendicular axes, east–west (x) and north–south (y), representing the diameter. Based on these measurements, megaliths were classified into three size categories: small (below 5 m), medium (5–10 m), and large (above 10 m). The cairn deposit was measured using a theodolite and spirit level from the surrounding ground surface. Orientation of the monuments was determined using a magnetic compass. The preservation status of each megalithic burial was assessed through visual field observation and classified into three categories: intact, disturbed, and completely disturbed. Intact burials refer to structure with no visible signs of damage. Disturbed megalithic burials show evidence of disturbance, usually in the form of digging and dislocation of deposit. Completely disturbed burials are mainly dug at the center, deposit of megalith has vanished and only scattered remains of megaliths are visible.
Spatial analysis was conducted using ArcGIS 10.8.2 to examine the relationship between archaeological features and the surrounding landscape. A Digital Elevation Model (DEM) derived from the Shuttle Radar Topography Mission (SRTM) 1 Arc-Second Global dataset (Entity ID: SRTM1 N25E082V3; resolution ~30 m) was downloaded from the USGS Earth Explorer portal and used to generate elevation and contour map. All spatial datasets were projected to the WGS 1984 UTM Zone 44 N coordinate reference system. Aster and vector layers were integrated to create thematic maps representing monument distribution, slope gradient, hydrology, and lithology. All analyses were limited to a 2 km buffer zone around Dantari Hill to assess subtle environmental associations. Density and spatial organisation of megalithic monuments were analyzed using Kernel Density Estimation (KDE) and buffer analysis. A three-dimensional (3D) terrain model was generated using the ArcScene module of ArcGIS 10.8.2 to visualise elevation and slope variation across the study area. Raster analyses were performed using a 30 m cell size to match the DEM resolution.
Microlithic materials were documented through both random and grid sampling strategies. Rock art panels were documented using a micro-level approach, recording their dimensions (length × width × height), orientation, number of depictions, height from the present ground surface, and color. Habitation areas were examined through cross-transect survey methods to identify artifact scatters and structural traces. The field data were collected between 6 February and 9 March 2022 by two of the authors through systematic survey.
Sampling strategy
A total coverage strategy was used to document all visible archaeological features at Dantari Hill. Megalithic structures were recorded systematically through on-site inspection and spatial mapping. Microliths were collected using random and grid methods: random method for microliths within the megalithic, and 10 × 10 m grids in artifact-dense areas. Rock art panels and habitation traces were recorded exhaustively using micro-level and cross-transect approaches, ensuring representative coverage of all site types and environmental zones.
Quality Control
GPS readings were cross-verified with satellite and topographic base maps to minimize locational error. Repeated measurements and standardized recording sheets were used for all physical attributes. Data entry was double-checked before integration into GIS to ensure consistency and accuracy.
Constraints
Extensive stone quarrying and human vandalism has led to the destruction of many megalithic monuments, making it difficult to identify their original morphological features. Several rock art panels are heavily faded or overlain by modern inscriptions and soot deposits, obscuring painted motifs and complicating interpretation. In habitation areas, agricultural activities have cut through large portions of the mound, disturbing its original form, stratigraphic deposits, and material remains. These factors collectively limited the site visibility, preservation quality, and the accuracy of contextual documentation.
(3) Dataset
3.1. Megalithic Dataset
This dataset contains comprehensive field documentation of 482 megalithic monuments recorded at Dantari Hill.
3.1.1. Data type
Primary data and processed data
3.1.2. Format names and versions
Microsoft Excel (.xlsx; Dantari_Megalithic.xlsx), ESRI Shapefile (.shp; Dantari_Spatial_Data.shp), Keyhole Markup Language (.kml; Dantari_Megalithic.kml), and raster outputs (.tif; Dantari_Topography.tif, Dantari_Slope.tif, Dantari_Density.tif, Dantari_Visibility.tif). All GIS data were created and analyzed using ArcGIS 10.8.2 and are compatible with later versions. Maps showing distribution, elevation, slope, density, and visibility (Figure 2) are provided in PDF and JPEG (.jpg) formats for general viewing.

Figure 2
GIS analysis reviled the distribution of megalithic monuments at Dantari Hill. (A) Cluster analysis (B) Elevation analysis (C) Density analysis (D) Viewshed analysis (E) Slop analysis.
3.1.3. Dataset Description
The Excel file contains detailed records of all 482 megalithic monuments, each assigned a unique identification number. Recorded attributes include cluster name, typology, latitude, longitude, altitude, dimensions (diameter, height, stone size), current deposit, and preservation condition. These variables were derived from systematic field survey and spatial documentation using handheld differential GPS and standard measuring instruments. The dataset provides a structured quantitative record that supports analysis of monument distribution, morphology, and preservation. It serves as a baseline for comparative studies of megalithic architecture and cultural landscapes in the Vindhya region and broader South Asian context.
3.2. Microlithic Dataset
3.2.1. Data type
Primary data and processed data.
3.2.2. Format names and versions
Microsoft Excel (.xlsx; Dantari_Microlithic.xlsx), Graph files (.png, .pdf), and JPEG images (.jpg; Microlith_Photoset). Compatible with Microsoft Excel 2016 and later versions.
3.2.3. Dataset Description
This dataset documents microlithic artifacts recovered from the Dantari Hill complex using random and grid sampling methods. Random finds were recorded in situ with contextual notes and photographs, while grid sampling employed 10 × 10 m trenches in high-density clusters. The Excel file provides detailed records, including artifact identification number, locational association, type, raw material, size, morphological measurements and weight. Graphs present quantitative distributions and frequency patterns, while photographs provide visual references for each artifact type. Together, these data support comparative studies on microlith typology, technology, raw material selection, and spatial patterning within the Vindhyan region.
3.3. Rock Art Dataset
3.3.1. Data Type
Primary data and interpretation of data.
3.3.2. Format names and versions
Microsoft Excel (.xlsx; Dantari_RockArt.xlsx), MS Word file, Graph and Chart files (.png, .pdf), and JPEG images (.jpg; RockArt_Photoset). KML file (.kml; Dantari_RockArt.kml) for spatial visualization. Compatible with Microsoft Excel 2016 and later versions.
3.3.3. Dataset Description
This dataset documents the physical and spatial attributes of painted rock shelters within the Dantari Hill complex. The Excel file records panel identification numbers, dimensions (length × width × height), orientation, depiction count, pigment, and height from the current ground surface. Graphs and charts summarize quantitative data such as motif frequency, pigment use, and orientation trends. High-resolution photographs and figures illustrate preservation conditions, including fading, overlapping, and soot or engraving marks. The dataset provides a baseline for studying rock art distribution, stylistic variation, and conservation status across the Vindhyan landscape.
3.4. Habitation Dataset
3.4.1. Data Type
Primary data and processed data.
3.4.2. Format Names and Versions
Microsoft Excel (.xlsx), JPEG (.jpg), and ESRI Shapefile (.shp).
3.4.3. Dataset Description
This dataset documents habitation sites identified through cross-transect survey across the Dantari Hill landscape. Each entry includes site location coordinates, contour, extent, surface material type, and disturbance indicators. Photographs and GIS layers illustrate site distribution and current land use. The dataset provides valuable information for analyzing settlement organization, site function, and the relationship between habitation zones, resource areas, and megalithic features. It serves as a reference for comparative landscape and cultural studies in northern India.
Creation dates
Field data were collected between 6 February 2022 and 9 March 2022, and the GIS datasets were prepared and updated between 2022 to 2025.
Dataset Creators
Dr. Virag Gopal Sontakke – Assistant Professor, Department of AIHC & Archaeology, Banaras Hindu University. Project leader and administrator; supervised fieldwork planning, research design, and overall coordination of the Dantari Hill documentation project.
Dheeraj Sharma – Research Scholar, Department of AIHC & Archaeology, Banaras Hindu University. Principal investigator responsible for field survey execution, data compilation, GIS analysis, and preparation of the complete dataset.
Dr. Prachi Virag Sontakke: Assistant Professor, Department of AIHC & Archaeology, Banaras Hindu University. Provided expertise and supervision in the documentation and interpretation of microlith datasets and the editing of the manuscript.
Deepesh Singh: Research Scholar, Department of AIHC & Archaeology, Banaras Hindu University. Contributed to habitation recording, contextual data collection, and artifact documentation.
Dr. Sachin Kumar Tiwary – Assistant Professor, Department of AIHC & Archaeology, Banaras Hindu University. Provided expertise and supervision in the documentation and interpretation of rock art datasets.
Shubham Saurabh – Research Fellow, Nava Nalanda Mahavihara, Nalanda, Bihar. Assisted in field survey operations and documentation of megalithic, microliths and rock art data.
Dhamma Ratna – Research Fellow, Nava Nalanda Mahavihara, Nalanda, Bihar. Contributed to field recording, contextual data collection, and artifact documentation.
Language
English
License
CC BY 4.0
Repository location
The dataset supporting this study is openly available in the Figshare Repository under the title “A Detailed Archaeological Dataset of the Megalithic Complex at Vindhyan Highlands: A Case Study of Dantari Hill, India”, accessible via DOI: 10.6084/m9.figshare.30639542.
Publication date
30-10-2025
(4) Reuse potential
This dataset can be reused by archaeologists, geographers, and heritage researchers to study how people lived and built monuments in the Vindhya region. It can support new GIS-based spatial studies, comparisons with other megalithic and rock art sites, and teaching in archaeological documentation methods. The data can also help in mapping, conservation planning, and understanding cultural landscapes. Because it includes both physical and spatial information, it can be combined easily with other regional datasets for research, education, and heritage management projects.
Acknowledgements
We sincerely thank the people of Baheri village for their kind support during the fieldwork at Dantari Hill. We are grateful to the saints and members of Kabir Ashram for providing accommodation and help during the survey. Our special thanks to Dhamm Ratna and Shubham Saurabh for their valuable assistance in the field, and to Dr. Sachin Kumar Tiwary for his guidance and support throughout the documentation process.
Competing Interests
The authors have no competing interests to declare.
