Learning Hub Archive - Page 3 of 3

AWS Amplify: Empowering Frontend Developers with Simplicity and Fun

admin / June 21, 2023

Amazon Web Services (AWS) has been a game-changer in the IT industry, revolutionizing the way we work. From humble beginnings with a small EC2 instance, AWS has evolved to offer powerful solutions like AWS IoT Greengrass for building IoT solutions. Throughout this incredible journey, AWS has consistently been the best friend of Computer Science Engineers, offering Elastic Beanstalk Servers, RDS, DynamoDB, Sagemaker for ML Engineers, and Glue, Glue Crawlers, Athena for Data Engineers. And now, AWS is here to support frontend developers too, with the amazing AWS Amplify—a one-step solution for deploying and hosting your frontend applications. Traditional Approach vs. AWS Amplify: A Refreshing Change Previously, when a frontend developer wanted to release their application on AWS, they had to go through a series of steps. For example, let’s consider the process of releasing a React project: Step 1: Create the project build. Step 2: Push the build code to S3. Step 3: Set up static web hosting in the S3 bucket and configure AWS bucket policies. Step 4: Configure a CloudFront Distribution. And if DNS mapping was required, one would need to worry about setting up aliases in Route53. Clearly, the process of setting up a frontend application on AWS involved numerous steps. But that’s not all—every time changes were made, developers had to rebuild the project and push the new build to S3. Now, enter AWS Amplify—a breath of fresh air for frontend developers. Simplifying Deployment with AWS Amplify Deploying with AWS Amplify is a breeze compared to the traditional approach of setting up S3 buckets and CloudFront Distributions. AWS Amplify acts as a convenient wrapper for S3 and CloudFront. With this powerful tool, all you need to do is push your code to GitHub branches, and it automatically syncs across different environments such as staging and production. The AWS Amplify console allows you to effortlessly manage these environments. Here’s how you can set up your project using AWS Amplify: Open your AWS account and navigate to the AWS Amplify service. Click on “Getting started with Amplify hosting.” Connect your version control tool (e.g., GitHub) from the provided screen. Next, you’ll be redirected to authorize AWS Amplify to access your GitHub account. Select the repository you want to integrate Choose the branch for deployment; currently we have two branches, “Main” and “Stage” Set up the IAM role on the subsequent screen Finally, review and deploy your project on the last screen. Voila! Your project is created, and you’ll receive your project URL. Advantages of AWS Amplify: No need to worry about CloudFront distributions or complex configurations. Eliminate the hassle of creating project builds—AWS Amplify automatically handles it for you. Effortlessly set up and release different environments, such as staging and production. Enjoy DNS management features as part of the package. Say goodbye to creating S3 buckets; AWS Amplify takes care of it. Enjoy a user-friendly console that simplifies your development workflow. Seamless integration with popular version control tools like GitHub and Bitbucket. So why spend precious time dealing with CloudFront validations or struggling with S3 bucket policies on weekends? Give AWS Amplify a try for your code deployment and hosting needs. It’s a surefire way to lighten the burden on frontend developers while automating key processes—an absolute bonus for DevOps. Unleash the power of AWS Amplify and let your creativity flourish as you build outstanding frontend applications. It’s time to have fun while deploying with confidence!

CICD with ArgoCD and CircleCI

admin / September 16, 2022

With the rise of automation, everyone is shifting from monolithic architecture to microservice architecture. Microservice architectures are more flexible and robust when it comes to security, development and deployment cycles. [Do check out our previous blog on Microservices to understand more about microservices] For establishing a microservice architecture, one needs to automate the entire integration and deployment process. Continuous Integration and Continuous Deployment mechanisms are required while setting up the code structures. An entire CICD flow looks like as follows: There are 3 steps in this cycle: Code Commit : In this phase, the developer pushes the code from the local system to a centralized storage space where other team members can do code review. Github, Bitbucket or AWS Code Commit can be used for this purpose Code Integration : In this phase, the fresh changes that the user has pushed into the code commit phase, are integrated with the existing code running on the server.This involves updating the docker file and pushing the code to deployment phase. There are multiple code integration tools available such as; Jenkins, CircleCI, Gitlab CI etc Code Deployment : This phase is responsible for updating the latest docker build inside the Kubernetes cluster. Various CD tools are; ArgoCD, Jenkins, Gitlab CD etc. For one of our projects related to microservices, we automated the entire deployment process and created a CICD pipeline using Github, CircleCI and ArgoCD tools. Next , I will take you through the steps followed for creating a CICD pipeline: (hopefully my experiences and learnings can help you in creating your first CICD pipeline1) Let’s get started.. CircleCI For configuring CircleCI, we need to create a CircleCI account and then link our github projects into CircleCI. Steps involved in configuring CircleCI: One can also sign-in to CircleCI using the github credentials, which will by default allow CircleCI to get access to all your github repositories. Then you need to create a project inside CircleCI with necessary information about your github project code and all the environment variables(if any) Next, you one needs to create a “.circleci” folder inside the project source code and create a “config.yaml” file inside it for defining the entire CircleCI process like limiting steps, running the integration steps then creating a build file and updating the docker build file. Also, before moving to the Continuous Deployment phase, one needs to create a github repository, which will contain the deployment packages. These deployment packages are useful for deploying code to K8S(Kubernetes) machines. The final step of the CircleCI process is responsible for updating the docker image version inside the deployment yaml files stored inside deploy repositories, so that the latest changes get deployed to our K8S. ArgoCD Once we pass the CircleCI process and have our latest docker image version inside the deployment yaml files in deploy repositories, we can start our deployment process. Steps to configure and setup ArgoCD inside our K8S cluster are as follows: Download argocd inside the K8S cluster using: https://argo-cd.readthedocs.io/en/stable/getting_started/ This will set up argocd inside the k8s cluster. Configure argocd inside the k8s, for this, apply ConfigMaps including the github repository settings. Check fo argocd default password inside EKS only using the following command: kubectl -n argocd get secret argocd-initial-admin-secret -o jsonpath=”{.data.password}” | base64 -d; echo The default username is : admin Login to the argocd console using : argocd login <ARGOCD_SERVER> Create your first project inside the ArgoCD console using Create App, after defining all parameters like cluster namespace, repository name, sync settings etc. Once the settings are completed, click on SUBMIT and the app will start getting created and you will be able to see your first app getting deployed to the K8S cluster Easy, isnt it? Once this flow is automated, you can save a lot of your time from solving unwanted merge conflicts or if in any case the developer has accidentally pushed a commit to PROD, ArgoCD will allow for an easy rollback. I hope this blog has helped you understand the process of creating your CCID pipeline!

Rise of Microservices

admin / September 13, 2022

Enterprises using monolithic systems to support large applications find it increasingly difficult to respond to evolving business priorities and rising customer expectations. Each functionality is built together as one single block, and it’s almost impossible to change or update a portion of it, without overhauling the complete monolith. Due to this, enterprises are rapidly exploring the advantages of microservices. Let’s take a closer look. What is a Microservices Architecture? The microservice architectural style is an approach to developing a single application as a suite of small services, each running in its own process and communicating with lightweight mechanisms, often an HTTP resource API. These services are built around business capabilities and independently deployable by fully automated deployment machinery. There is a bare minimum of centralised management of these services, which may be written in different programming languages and use different data storage technologies. Microservices vs Monolith With monolithic architectures, all processes are tightly coupled and run as a single service. This means that if one process of the application experiences a spike in demand, the entire architecture must be scaled. Adding or improving a monolithic application’s features becomes more complex as the code base grows. This complexity limits experimentation and makes it difficult to implement new ideas. Monolithic architectures add risk for application availability because many dependent and tightly coupled processes increase the impact of a single process failure. With a microservices architecture, an application is built as independent components that run each application process as a service. These services communicate via a well-defined interface using lightweight APIs. Services are built for business capabilities and each service performs a single function. Because they are independently run, each service can be updated, deployed, and scaled to meet demand for specific functions of an application. What are the advantages associated with microservices? Strong Module Boundaries: Microservices reinforce modular architecture, which is much needed for larger teams. Independent Deployment: Simple services are easier to deploy and since they are autonomous, they are less likely to cause failures when they go wrong. Technology Deployment: With microservices you can mix multiple languages, development frameworks and data storage technologies. Reusable Code: Microservices are individually deployed which makes it easier to connect a microservice with other applications as well. What are the disadvantages? Complexity: Microservices follow the concept of distributed systems, hence, they offer all the complexities of distributed systems. Managing microservices is a critical task. There is a higher possibility of system failure due to communication between different microservices. Regression analysis and testing are difficult tasks in microservices. Developer needs to cater load balancing and network latency. Debugging issues in a microservice architecture is a hectic challenge, as each service has its own set of logs. So due to multiple services a developer may face difficulty to identify the log track. However, you be happy to know that these drawbacks can be addressed well, if you make use of the right set of tools and services Automated Deployment of Microservices: Error Handling Mechanisms: When we want to automate the process of deployment of microservices we need to ensure that our code is using proper error handling techniques. Unit Testing: Proper unit testing should be done at application level for each functionality and feature. Developers should write individual test cases for each functional component. Developers can also set the threshold limit for the code coverage which the code should follow for passing the Unit Tests in deployment process. CI/CD Integration: Use of CI/CD tools is a must when it comes to DevOps Engineering. Testing tools like; Jenkins, CircleCI can be used for CI/CD. Automated Build of Docker/Kubernetes Containers: With integration of CI/CD pipelines, these tools also manage the automatic build of containers for setting up code on container images. Project Case Study One of our clients from Neo Banking Domain, explained the need to use Microservice Architecture in their Project. Client demanded all the functionality of the application to be organized separately and the solution to be scalable and robust. This was required to ensure that in case infrastructural or business changes were implemented in one service, other services would not be affected. Hence, the solution was nothing, but, Microservices! The project’s core services were organised into different micro services: Financial Service: responsible for addressing all communications related to Payment Client. Account Service: responsible for keeping all customer’s data. Card Services: responsible for providing various digital and virtual card services. Statistics Services: responsible for providing all statistical information related to a customer’s account like Net Debited/Credited Amount, Category Wise Spend, Balance etc. All these services were organized into an individual microservice. These were orchestrated using AWS Step Functions inside Kubernetes clusters. With growing demand for Microservices, developers are moving towards different Managed AWS Microservices like AWS Stepfunctions, AWS Simple Workflows(SWF). We will be discussing AWS Stepfunctions and AWS Simple Workflows in our next blog. Stay tuned! References: https://martinfowler.com/microservices/https://docs.aws.amazon.com/whitepapers/latest/microservices-on-aws/microservices-on-aws.html

Top 8 Cloud Datawarehousing Technologies

admin / August 22, 2022

At a time where information and insights from data are the most significant assets that any business has, implementing data warehouse solutions has been more critical than it has been at any other time in the past. So, what exactly does a data warehouse mean? In a nutshell, a Data Warehouse is a basic data set for supporting data analysis and acting as a channel across the different sets of analytical tools and data stores. At its core, Data Warehousing solutions incorporate very versatile features that cater to varied scopes for data analysis, management, and consolidation. Not just that, you can even extrapolate crucial business data points to ensure consistency across your analytics platforms. Modern data warehouse solutions are now even coming up with inbuilt AI and ML algorithms that can tremendously help you in making key business decisions. As of today, almost all the major data warehouse solutions are delivered through the cloud with the flexibility of adding/removing features, scale-up/down within a few seconds with just a click of a mouse. Now let’s check which cloud data warehousing services providers are the best as of today. Teradata Integrated Data Warehouse: Teradata has been a market leader when comes to data warehousing and data management for more than 35 years. Teradata data warehouse is built upon the most impressive database technologies and has been serving the most leading organizations of the world. Teradata offers a 360-degree understanding and insights of the data, that can be pulled together from a range of sources. Teradata QueryGrid provides insights into actionable big data. In addition, you can deploy Teradata on IntelliCloud which is also provided by Teradata, on-premise or public, private, or hybrid cloud setup. SAP Data Warehouse Cloud SAP is very popular in the world of data analysis, data development, and business analytics. The SAP data warehouse cloud is ideal for organizations that need to make more insightful business choices. This enterprise-ready data warehouse merges all the possible data sources into one single environment that helps to get more insights from your data that can help amplify crucial business decisions. SAP’s data warehousing semantic layer helps with making analytics easier for clients with persona-driven insightful data information. It also provides instant access to application data with its pre-built adapters. Even better, SAP information warehousing is versatile, adaptable, elastic, and open, settling on it is a decent decision for organizations, all things considered. Oracle Autonomous Warehouse The Oracle Autonomous Data Warehouse offers organizations a simple to-utilize and available framework that scales with user activities. It was designed to provide super-fast, reliable, and elastic performance with minimal to zero administration. Oracle is a great product for novices and beginners who are trying to balance out the pros and cons of data warehouses on the cloud. It is the best choice for an end-to-end fully managed and reliable cloud service that makes using and implementing cloud services a walk in a park. Furthermore, Oracle’s data warehouse is exceptionally flexible, highly elastic, allowing organizations to expand their computing capabilities limits as their organization’s requirements change. You just need to pay for what you use, and everything seamlessly integrates with a range of business analytics and IoT tools. Microsoft Azure Synapse Microsoft Azure Synapse is evolved from Microsoft Azure SQL Data warehouse offerings. Synapse is the most advanced enterprise data warehousing solution that Microsoft has come up with to date. With Microsoft’s data warehouse cloud offering, you can easily query data according to individual requirements. There’s flexibility to access both provisioned and serverless on-demand resources as well. Also, Synapse empowers to leverage the power of AI, ML, and business intelligence as a part of the combined business intelligence solution ecosystem. Additionally, Microsoft has the most advanced privacy and security features across its data warehousing solutions. IBM Db2 Warehouse The IBM Db2 Warehouse provides a great relational database solution that delivers high performance and high-quality analytics to its customer. IBM Db2 seamlessly integrates with the in-memory columnar database technology from IBM. This provides an enormous advantage for organizations requiring a high-performance database solution. Users can quickly initiate the cloud deployment on the IBM cloud. There’s also a traditional on-premise version of the Db2 data warehouse solution. Google BigQuery Google BigQuery is a major component of the Google cloud ecosystem. This exceptionally adaptable and serverless cloud data warehouse solution is ideal for organizations that need to minimize expenses and at the same time benefit from the power of cloud computing. If you need to make quick business crucial decisions using data analytics BigQuery has you covered. BigQuery separates itself by its availability and accessibility. Moreover, you can proficiently run your analytics environments with a three-year TCO that is up to 34% less expensive than other cloud offerings. Integrating with AI and ML tools of Google is another key differentiator in case you’re keen on venturing into the world of AI/ML that Google BigQuery has to offer. Snowflake Snowflake is a very popular data warehousing solution that offers an assortment of choices for public cloud technology. With Snowflake, you can make your business more information-driven, empowering you to create stunning user experiences. The convenient and flexible pricing model from Snowflake helps you to save on costs and pay only for resources and services that you use. Snowflake’s very robust data warehouse architecture improves dataflow and while reducing unnecessary complexities of your data model. You additionally get self-administration admittance to all the additional usefulness that you need. Amazon Redshift Amazon Redshift is quite undoubtedly the most well-known data warehouse solution available in the market today. The service drives the analytical initiatives of new businesses and startups and Fortune 500 organizations at the same time. The biggest, greatest brands utilizing Redshift today are Intuit, Lyft, Yelp, and surprisingly Mcdonald’s. Probably the best thing about Redshift is that it integrates seamlessly with the data lake and AWS environment. Redshift allows technical users and business users to query and analyze the immense amount of non-structured, semi structured, and fully structured from a host

Learnings-for-Creating-IoT-Data-Pipelines-on-AWS-1

Learnings From Creating IoT Data Pipeline On AWS

admin / July 28, 2022

Handling IoT devices and doing computations on their readings is always a tedious process as it requires both Hardware and Software handling expertise. Furthermore, It becomes even more complex when you need to transfer data using traditional MQTT protocols and design your own servers and infrastructure for handling the flow of data from IoT devices to your software platform. But, what if you can leave the entire infrastructure and data flow on someone else, while just initializing all the operations by yourself, and post that sit back and relax! Does this sound interesting? Obviously Yes! AWS cloud platform provides a wide variety of services where you can setup your entire data flow of IoT devices on cloud and all the security, data backup is managed by AWS itself AWS provides different ways to setup IoT data streaming in your software. One of the ways is explained below: EDGE → API GATEWAY → KINESIS DATASTREAMS → FIREHOSE → S3 → ATHENA AWS Kinesis Data Streams: Kinesis Data Streams can be used for streaming real time sensor data in your dashboards. AWS Cognito: For validating the source of data. AWS API Gateway: Serverless API for injecting the data in Kinesis Data Streams. This is the endpoint, which the IoT Edge Client uses for inserting data in your data pipeline. AWS Kinesis Firehose: For dumping data obtained from Kinesis Data Streams in S3. AWS Athena: For performing different aggregation and analysis on real time data stored in S3. Challenges for Developer Although this pipeline seems to be simple and straightforward, there are certain areas where the Developer might be challenged and will have put in an extra effort for setting up the flow: Challenge: Kinesis Data Streams are designed to send a blob of data to Firehose, which in turn sends this data to its destination. But what if there is an array of records which need to be sent in each iteration? At this point, one faces the challenge, as Kinesis can easily send a single record but when it comes to multiple records, you may experience serialization errors. Solution: The solution to this problem is to use the right Kinesis method while sending data through API Gateway, PUT RECORD for sending a single record and PUT RECORDS for sending multiple records. However, you need to be cautious while designing the message templates in API Gateway for both methodsSending Data from API Gateway to Kinesis Datastreams: Challenge: Kinesis Firehose offers different formats which can be used by a Developer while creating data dumps in S3. The available data formats are CSV, JSON, Parquest, ORC. Initially you might wonder how can a data format pose to be a challenge? However, once the data size increases exponentially with time, it will become a challenge. CSV and JSON formats are very bulky data sets in the long run. Solution: The right data structure is Parquet, because all other data formats store the data in a row format while Parquet stores it in a columnar format, due to which the query execution from Athena is faster. Also, Parquet data format itself removes unnecessary space and black fields while storing, which ends up saving S3 space as well.Dumping Data in Kinesis Firehose Destination in correct format: Querying Partitioned Data from S3 in Athena: Challenge: Firehose offers a special reward of storing data in partitioned format in S3, for every date. This is a big gift when you are querying on big S3 data sets from Athena. Because each time you query S3 data from Athena, athena scans a chunk of data in S3.Two use cases may arise in this case: If the data is not partitioned: If the data is not partitioned, Athena will scan the entire chunk of data present in S3 bucket, i.e, if you have 10 years data records in S3 bucket and each time you query to fetch latest record, it will scan the entire 10 year data, resulting in lot of GET requests which in turn increases S3 cost by big numbers. If the data is partitioned: If the data is partitioned, Athena will scan only the latest day record even if you have 10 years data present inside your S3 bucket, and your GET requests won’t increase. The challenge here is; How to read partitioned data in Athena? Solution: The solution here is to use the concept of Partition Projection while reading partitioned data in S3. You can implement Partition Projection while creating Glue Tables for Firehose. Partition Projection helps reading data for a particular timestamp value only, due to which you might end up scanning only a particular set of records which are actually required while executing the query. Project Case Study: We recently created an IOT Based Product for one of the Power Sector Client. The Product focuses on receiving data from IOT sensors installed at Edge Location and then displaying this information on Web Based Application. Data from IOT sensors travels in cloud(aws) and is encrypted so that the data integrity and security is ensured.The Backend Infrastructure is designed using Amazon Web Services(AWS) and Django Rest Framework. We have used Kinesis for Real Time Streaming on our dashboard and have used Athena for displaying aggregated results for time based filters. Initially during setting up the infrastructure, we even faced some bottlenecks in AWS Kinesis Data pipelines, i.e, while reading long array jsons in Kinesis or while querying through the partitioned data stored in S3 Bucket. But with some proper research work and collaborative engineering we achieve our goal and completed this beautiful Product. Every new recipe may not be perfect in the first go but if we take lessons from other Chef experiences then we may definitely end up making a yummy dish.!

Industrial

Retail / CPG

Financial services

Public Sector

Healthcare

Industrial

Retail

Financial services

Public Sector

Capabilities

Generative AI

Computer Vision

Data Engineering

Artificial Intelligence

Demand & Supply Planning

Products

Wildlife Eye

WildlifeIQ

Guard Vision AI

FootFall Vision

Knowledge Miner

HospitalGuard AI

ForeSight

LendSecure

CivicEye

Capabilities

Generative AI

Computer Vision

Data Engineering

Credit Risk & Fraud

Demand & Supply Planning

Capabilities

Wildlife Eye

Forester Buddy

GuardAI Vision

Football Vision

Knowledge Miner

EWS

Foresight

Cement Bag

About Us

News

Partners

About Us

News

Partners

Case Studies

Research Papers

Learning Hub

Whitepapers

Case Studies

Blog

Research Papers

Learning Hub

Whitepapers

Industrial

Retail / CPG

Financial services

Public Sector

Healthcare

Industrial

Retail

Financial services

Public Sector

Capabilities

Generative AI

Computer Vision

Data Engineering

Artificial Intelligence

Demand & Supply Planning

Products

Wildlife Eye

WildlifeIQ

Guard Vision AI

FootFall Vision

Knowledge Miner

HospitalGuard AI

ForeSight

LendSecure

CivicEye