Data analysis project
$250-750 USD
Paid on delivery
I need data analysis project by python.
Let's assume that we would like to collect labels for each column as Organization, Person, Address and Other from 250,000 different datasets. For instance, different column names such as vendor_name, business, name, corporation and parent_company can be used to represent Organization and it becomes difficult to label each column manually when you have a large number of datasets. Explain your ideas and methods to efficiently obtain labels in as much detail as possible. After that I will award you.
Project ID: #21229973
About the project
26 freelancers are bidding on average $523 for this job
Hello, I have gone through your job posting and become very much interested to work with you. I am an expert in this field. I have already completed several projects like this. For evidence you can see my profile. Pl More
I am a Data Scientist with 3+ years of experience in Data Analysis, Statistical Modelling, Machine Learning, Deep Learning, Computer Vision and Natural Language Processing. I have worked across various domains such as More
Dear sir. Your project attracted my attention at first glance, because I've extensive experience in Data Analysis Programming. I'm really confident about your project, and very eager to join your project. If we have a More
HI, I am data scientist and have good experience in python and R programming. My area of interest is statistical Analysis of dataset and apply ML/deep learning algorithm. I can intern your tasks. Kind Regards
Hi, I can help you get this done. I have skills in Python, Data Processing, Machine Learning (ML), Data Mining, Statistical Analysis
Hi, First of all, your explanation was not very clear to me. Do we need to categorize the column label or something else? I am not able to understand your explanation completely. Please share more detail as I here More
Hello sir. I'm excited about your project, because I've really rich experience in Data Analysis Programming. I've developed many projects similar to yours and excellent skills. If you award me, I'll provide wonderfu More
hi, I'm a professional statistical analyst seeking opportunity to provide highest quality services in the following areas of Statistics and Econometric. Looking for outstanding opportunities to apply my academic creden More
Hello, I've read your project requirements thoroughly. The most possible solution for your problem would be to collect all the keywords (column names) through a program then filter them out for unique values. After th More
It is a job access can do it. Just simple filtration of Variable, if your data is stored. I can do your project efficiently
Hello, For what I have understand is you need to automate the process of labeling the columns from around 250000 datasets into the finite columns like organization , etc. I would propose to create a dictionary for ever More
My approach would be to: 1. Collect all column names from the different data sources 2. Apply basic text processing steps and regular expression methods like removal of special characters and stopwords, treatment of st More
My preferred method of freelancing is an interactive approach to project solving. I have an MSEE specializing in Digital Signal/Image/RF Processing. I do my work in MATLAB (expert). I also do Python programming.
Data cleaning & extraction can be done various pattern matching and regrex based on dataset given. Custom algorithm to get more effiecient extraction based on given input All models will be coded in python, so all ma More
That's Simple , NER - Named Entity Recognition with SpaCy or NLTK , will do your task . Process will involve from reading column names to categorizing - with tokenizing , chunking , etc . to your datasets , actually wh More
0/ Make data mining based on your datasets 1/ Create a list of words from all datasets using bag of words 2/ display those words on the screen 3/ use the list of stop words and create rules that will be used for separ More