Parasayya Telli

Parasayya Telli

Data Engineering Consultant for Digital Transformation

I am thrilled to embark on this entrepreneurial journey as a data engineering consultant. Together, we will empower organizations to unlock the full potential of their data assets. With my expertise in data engineering, personalized solutions, and commitment to excellence, I am confident that I can help businesses thrive in the era of data-driven decision making.

Professional Experience

Data Engineering and Report Generation

  • Delivered datasets collected from remote logs and stored in Elasticsearch
  • Automated generation of reports
  • Integrated external APIs to enhance data utilization

AWS Platform and ETL Development

  • Built APIs and ETL pipelines using AWS services like Lambda, API Gateway, S3, and Elasticsearch
  • Designed end-to-end ETL processes using Elasticsearch, RDS, Data Migration Service, Lambda, and Kafka

ETL Pipelines on Google Cloud Platform

  • Created pipelines using Cloud Data Fusion on GCP
  • Utilized tools like Spark for data preprocessing and Wrangler for data transformation
  • Stored data using BigQuery for further analysis

ETL Pipelines and Data Analytics in Medical Field

  • Developed data pipelines using Python and SQL scripts
  • Designed a Hadoop-based central data warehouse
  • Conducted data analytics using Hive on Hadoop

Data Scraping and Transformation

  • Scraped data from diverse sources including static websites, dynamic websites, and mobile apps
  • Transformed scraped data into client-specified formats for analysis

Automated Data Dumping Process

  • Created pipelines using StreamSets for data dumping from multiple databases to Hadoop
  • Developed Java programs using Kafka to drive StreamSets pipelines

Computer Vision Model for Covid-19 Diagnosis

  • Built a computer vision model using PyTorch to diagnose Covid-19 from x-ray images
  • Gathered data sets from Kaggle and client resources

Railway Seat Confirmation Probability Improvement

  • Utilized data scraping to analyze and calculate reservation success probabilities
  • Developed an application to suggest optimal ticket booking station pairs

Linux Administration

  • Installed Linux OS on clusters, implemented distributed storage, NAS, and Hadoop
  • Automated tasks using BASH scripts and scheduled them using cron jobs

IoT Projects

  • Provided IoT services, including message updates and kiosk system development
  • Created prototypes for home automation using IoT techniques

Network Administration and Management

  • Planned and allocated IP addresses, configured switches and routers
  • Developed Python scripts to monitor node statuses
  • Provided a web interface for client monitoring

Areas of Expertise

Data Pipeline Development

Design and implementation of scalable data pipelines using modern technologies.

Big Data Processing

Handling and processing large datasets efficiently using distributed systems.

Data Warehousing

Building and optimizing data warehouses for efficient data storage and retrieval.

Featured Projects

Data Pipeline Automation

Developed automated data pipelines for efficient data processing and transformation using modern ETL tools and cloud services.

Tech Stack: Python, Apache Airflow, AWS, SQL

Data Analytics Platform

Built a comprehensive data analytics platform for business intelligence and reporting, enabling data-driven decision making.

Tech Stack: Python, SQL, Tableau, Power BI

Cloud Data Solutions

Implemented cloud-based data solutions for scalable data storage and processing, optimizing performance and cost.

Tech Stack: AWS, Azure, Python, SQL