Three steps are to be followed -
1.Data ingestion - This is the data extraction from various sources .The data source may be a CRM like salesforce ,ERP etc.
2.Data storage - it can be stored either in HDFS which works for sequential basis or No sql base
3.Data processing-It is processed through one of the processing frameworks like spark ,map reduce ,pig etc.