abhijeets7663@gmail.com
help@ccatcracker.in

DBDA Course Details

DBDA

Linux Programming and Cloud Computing 80 Hours

Python Programming and Advanced Analytics 90 Hours

Object Oriented Programming with Java 880 Hours

Statistical Analysis with R 70 Hours

Data Collection & DBMS (Principles,Tools & Platforms) 70 Hours

Cloud Computing & HPC Applications 40 Hours

Big Data Technologies 140 Hours

Data Visualization - Analysis and Reporting 50 Hours

Practical Machine Learning 100 Hours

Aptitude and General English 50 Hours

Effective Communication 50 Hours

Project 120 Hours


Fees

The total fees of the course is Rs. 90,000/- plus Goods and Service Tax (GST) currently 18%.

The course fees has to be paid in two installment as per the schedule.

First installment is Rs. 10,000/- plus Goods and Service Tax (GST) currently 18%.

Second installment is Rs. 80,000/- plus Goods and Service Tax (GST) currently 18%.

Placement

Overall placement for Big data course is not like DAC .

As big data is very large technology and Practically companies
will not give preference to fresher for big data AI.

But you will get good calls because of CDAC so it depends on your performance .

Average placement for DBDA top college is above 70% .

No of interview calls are depends on your ccee exam score.

CCEE

Top 5 Colleges For Placement :

1. CDAC ACTS Pune

2. Sunbeam Pune

3. IACSD Akurdi

4. VITA Mumbai

5. Knowledge Park Banglore

Job Profile In DAC

Java Developer

Python developer

Big data developer

Software Developer

Testing Engineer


Post Graduate Diploma in Big Data Analytics (PG-DBDA) has the following focus:

1. To explore the fundamental concepts of big data analytics
2. To develop in-depth knowledge and understanding of the big data analytic domain.
3. To learn to analyze the big data using intelligent techniques.
4. To understand the various search methods and visualization techniques.
5. To learn to use various techniques for mining data stream.
6. To understand the applications using Map Reduce Concepts
7. To learn Cloud Computing, accessing resources and services needed to perform functions with dynamically changing needs.

Best Center for DBDA

1. CDAC ACTS Pune

2. Sunbeam Pune

3. IACSD Akurdi

4. VITA Mumbai

5. Knowledge Park Banglore

After Course Complete :

After completing this courses students shall be expert in following things:

Big Data tools, Big Data Querying Tools, such as Pig, Hive, and Impala, park

Integration of data from multiple data sources

NoSQL databases, such as Hbase, MongoDB

Knowledge of various ETL techniques and Implementing ETL Process Monitoring performance and advising any necessary infrastructure changes

Developing reports, dashboards using D3 & Tableau

Big Data Development

Business Analyst

DBDA Eligibility :

Graduate in Engineering (10+2+4 or 10+3+3 years) in IT / Computer Science / Electronics / Telecommunications / Electrical / Instrumentation. OR MSc/MS (10+2+3+2 years) in Computer Science, IT, Electronics.

Mathematics in 10+2 (exempted for candidates with Diploma + Engineering) OR

Post Graduate Degree in Engineering Sciences with corresponding basic degree (e.g. MSc in Computer Science, IT, Electronics) OR

4-year Graduation in Bioinformatics, OR

Post Graduate Degree in Mathematics / Statistics / Physics / MBA Systems, OR

MCA

Note: The candidates must have secured a minimum of 55% marks in their qualifying examination

Detail Syllabus :

Linux & Cloud Programming :

Linux History and Operation, Installing and Configuring Linux, Shells, Commands, and Navigation, Common Text Editors, Administering Linux, Introduction to Users and Groups, Linux shell scripting

Introduction to Cloud Computing: Definition, Characteristics, Components, Cloud provider, SAAS, PAAS, IAAS and other Organizational scenarios of clouds, Administering & Monitoring cloud services, benefits and limitations, Deploy application over cloud. Comparison among SAAS, PAAS, IAAS, Cloud computing platforms: Infrastructure as service: Amazon EC2, Platform as Service: Google App Engine, Microsoft Azure Utility Computing, Elastic Computing, SLA, clusters, cloud analytics, challenges of cloud environment, HPC in the cloud

Parallel Processing Concepts: Physical Organization and building blocks of High Performance Computing Systems, Processors and Multi-Core Architectures, Vector processing, Super-scalar, In-order execution, Instruction-Level Parallelism etc., FMA, 32 and 64 bit types, ISA, Accelerators such as GPGPUs and Xeon Phi. Threads and Processes, Multi-processing OS, Parallel I/O, General concepts

Parallel Programming Models and Parallel Algorithms Design: Application domains of HPC, Decomposition Techniques: Data parallelism, Functional parallelism, Divide and Conquer etc., Characteristics of Tasks and Interactions, Mapping Techniques for Load Balancing, Methods for Containing Interaction Overheads, Granularity of parallelism, Programming OpenMP

Python Programming and Advanced Analytics :

Introduction to Python, Basic Syntax, Data Types, Variables, Operators, Input/output, Flow of Control (Modules, Branching), If, If- else, Nested if-else, Looping, For, While, Nested loops, Control Structure, Break, Continue, Pass, Strings and Tuples, Accessing Strings, Basic Operations, String slices, Working with Lists, Introduction, Accessing list, Operations, Function and Methods, Files, Modules, Dictionaries, Functions and Functional Programming, Declaring and calling Functions, Declare, assign and retrieve values from Lists, Introducing Tuples, Accessing tuples, Visualising using matplotlib, seaborn

Advanced Python: Object Oriented, OOPs concept, Class and object, Attributes, Inheritance, Overloading, Overriding, Data hiding, Operations Exception, Exception Handling, Except clause, Try-finally clause, User Defined Exceptions, Data wrangling, Data cleaning

Python Libraries pandas, numpy, scipy, scrapy, plotly, selenium, beautiful soup

Advanced Analytics: Introduction to Business Analytics using some case studies, Making Right Business Decisions based on data, Exploratory Data Analysis - Visualization and Exploring Data, Descriptive Statistical Measures, Probability Distribution and Data, Sampling and Estimation, Statistical Interfaces, Predictive modeling and analysis, Regression Analysis, Forecasting Techniques, Simulation and Risk Analysis, Optimization, Linear, Non linear, Integer, Decision Analysis, Strategy and Analytics, Text analytics, NLP, Social network analysis, web scrapping, Dimensionality issues, Ridge & lasso regression, bias/variance trade off, density, PCA, FA, feature selection, Bagging and boosting, Simulation : Monte carlo

Overview of Factor Analysis, Directional Data Analytics, Functional Data Analysis


Object Oriented Programming with Java 8

Oops Concepts, Data Types, Operators and Language, Constructs, Inner Classes and Inheritance, Interface and Package, Exceptions, Collections, Threads, Java.lang, Java.util, Java.awt, Java.io , Java Persistent, Servlets, Java Virtual Machine, Lambda Expressions, Introduction of JDBC API


Statistical Analysis with R :

Probability & Statistics: Introduction to Statistics- Descriptive Statistics, Summary Statistics Basic probability theory, Statistical Concepts (uni-variate and bi-variate sampling, distributions, re-sampling, statistical Inference, prediction error), Probability Distribution (Continuous and discrete- Normal, Bernoulli, Binomial, Negative Binomial, Geometric and Poisson distribution) , Bayes’ Theorem, Central Limit theorem, Data Exploration & preparation, Concepts of Correlation, Regression, Covariance, Outliers etc.

R Programming: Introduction & Installation of R, R Basics, Finding Help, Code Editors for R, Command Packages, Manipulating and Processing Data in R, Reading and Getting Data into R, Exporting Data from R, Data Objects-Data Types & Data Structure. Viewing Named Objects, Structure of Data Items, Manipulating and Processing Data in R (Creating, Accessing , Sorting data frames, Extracting, Combining, Merging, reshaping data frames), Control Structures, Functions in R (numeric, character, statistical), working with objects, Viewing Objects within Objects, Constructing Data Objects, Building R Packages, Running and Manipulating Packages, Non parametric Tests- ANOVA, chi-Square, t-Test, U-Test,Interactive reporting with R markdown, Introduction to Rshiny


Data Collection & DBMS (Principles,Tools & Platforms)

Database Concepts (File System and DBMS), Database Storage Structures (Tablespace, Control files, Data files), Structured and Unstructured data, SQL Commands (DDL, DML & DCL), data collection. the tools And how data can be gathered in a systematic fashion, Dataware Housing concept , No-SQL, Data Models - XML, working with MongoDB), Cassendra- overview, architecture, comparison with MongoDB, graph databases, in-memory databases

Tools - OLTP and OLAP, data preparation and cleaning techniques

Cloud Computing & HPC Applications

Introduction to Cloud Computing: Definition, Characteristics, Components, Cloud provider, SAAS, PAAS, IAAS and other Organizational scenarios of clouds, Administering & Monitoring cloud services, benefits and limitations, Deploy application over cloud. Comparison among SAAS, PAAS, IAAS, Cloud computing platforms: Infrastructure as service: Amazon EC2, Platform as Service: Google App Engine, Microsoft Azure Utility Computing, Elastic Computing, SLA, clusters, cloud analytics, challenges of cloud environment, HPC in the cloud

Parallel Processing Concepts: Physical Organization and building blocks of High Performance Computing Systems, Processors and Multi-Core Architectures, Vector processing, Super-scalar, In-order execution, Instruction-Level Parallelism etc., FMA, 32 and 64 bit types, ISA, Accelerators such as GPGPUs and Xeon Phi. Threads and Processes, Multi-processing OS, Parallel I/O, General concepts

Parallel Programming Models and Parallel Algorithms Design: Application domains of HPC, Decomposition Techniques: Data parallelism, Functional parallelism, Divide and Conquer etc., Characteristics of Tasks and Interactions, Mapping Techniques for Load Balancing, Methods for Containing Interaction Overheads, Granularity of parallelism, Programming OpenMP


Big Data Technologies

Introduction to Big Data- Big data definition, enterprise / structured data, social / unstructured data, unstructured data needs for analytics, What is Big Data, Big Deal about Big Data, Big Data Sources, Industries using Big Data, Big Data challenges.

Hadoop: Introduction of Big data programming-Hadoop, History of Hadoop, The ecosystem and stack, The Hadoop Distributed File System (HDFS), Components of Hadoop, Design of HDFS, Java interfaces to HDFS, Architecture overview, Development Environment, Hadoop distribution and basic commands, Eclipse development, The HDFS command line and web interfaces, The HDFS Java API (lab), Analyzing the Data with Hadoop, Scaling Out, Hadoop event stream processing, complex event processing, MapReduce Introduction, Developing a Map Reduce Application, How Map Reduce Works, The MapReduce Anatomy of a Map Reduce Job run, Failures, Job Scheduling, Shuffle and Sort, Task execution, Map Reduce Types and Formats, Map Reduce Features, Real-World MapReduce,

Hadoop ETL: Hadoop ETL Development, ETL Process in Hadoop, Discussion of ETL functions, Data Extractions, Need of ETL tools, Advantages of ETL tools.

Introduction to HIVE : Programming with Hive: Data warehouse system for Hadoop, Optimizing with Combiners and Partitioners (lab), Bucketing, More common algorithms: sorting, indexing and searching (lab), Relational manipulation: map-side and reduce-side joins (lab), evolution, purpose and use

Map Reduce: Overview and concepts, interface to HDFS (HTTP, CLI and Java API), high availability and Name Node federation, Map Reduce developing and deploying programs, optimization techniques, Map Reduce Anatomy, Data flow framework programming Map Reduce best practices and debugging, Introduction to Hadoop ecosystem, integration R with Hadoop

HBase: Overview, comparison and architecture, java client API, CRUD operations and security

Programming Pig: Engine for executing data flows in parallel on Hadoop: Overview, comparison and architecture, Latin scripting and statements, data types, UDF’s, built in functions and use cases

Hadoop Environment: Setting up a Hadoop Cluster, Cluster specification, Cluster Setup and Installation, Hadoop Configuration, Security in Hadoop, Administering Hadoop, HDFS – Monitoring & Maintenance, Hadoop benchmarks, Hadoop in the cloud.

Introduction to Apache Spark and Use Cases :
Apache Spark APIs for large-scale data processing: Overview, Linking with Spark, Initializing Spark, Resilient Distributed Datasets (RDDs), External Datasets, RDD Operations, Passing Functions to Spark, Working with Key-Value Pairs, Shuffle operations, RDD Persistence, Removing Data, Shared Variables, Deploying to a Cluster Spark Streaming, Spark MLlib and ML APIs, Spark Data Frames/Spark SQL, Integration of Spark and Kafka, Mapreduce, mongodb with spark


Data Visualization - Analysis and Reporting

Business Intelligence- requirements, content and managements, information Visualization, Data analytics Life Cycle, Analytic Processes and Tools, Analysis vs. Reporting, Modern Data Analytic Tools, Visualization Techniques, Visual Encodings, Visualization algorithms, Data collection and binding, Cognitive issues, Interactive visualization, Visualizing big data – structured vs unstructured, Visual Analytics, Geomapping, Dashboard Design


Practical Machine Learning

Supervised and Unsupervised Learning , Uses of Machine learning , Clustering, K means, Hierarchical Clustering, Decision Trees, Classification problems, Bayesian analysis and Naïve bayes classifier, Random forest, Gradient boosting Machines, Association rules learning, Apriori and FP-growth algorithms, Support vector Machines, Linear and Non liner classification, ARIMA, Neural Networks and its application, Neural Net & its applications, deep learning algorithms, KNN, NLP, NLTK, ML modeling using Scikit learn, AI and its application


How to Prepare for DBDA interview :

DBDA Interview Preparation Course Free :

Interview Preparation




About Us

Hey guys this is the website I have developed to all those students wants to success in cdac from start to end here you will find everything regarding CDAC.


Test Series

This online test series i have designed for ccat exam you will feel like you are giving ccat exam after taking this test you will feel confident for exam. I recommend you guys at leaset give 2,3 test before exam my quetions are very relevent to ccat.


Contact Us

This is the website only to clear doubts of all the students so feel free to contact us at any time for anything regarding cdac.