wikipedia on a circuit board (in progress)
Broker
Posts: 4
Has anyone put wikipedia on a propeller PCB? I downloaded the entire wikipedia site to my laptop (250gigs to start) and have began making a PCB that contains a propeller and a 1TB SD-card where I can eventually put the site data on.
My plan is to make a intelligent robot that can retrieve wikipedia data on command and eventually I would like it to get data by itself, but so far I am having difficulties just starting.
How would I go about retrieving data quickly from the SDcard - would data storage need to be arranged in a fashion like Google does or MYSQL? What do I need to learn?
Help and comments would be appreciated.
My plan is to make a intelligent robot that can retrieve wikipedia data on command and eventually I would like it to get data by itself, but so far I am having difficulties just starting.
How would I go about retrieving data quickly from the SDcard - would data storage need to be arranged in a fashion like Google does or MYSQL? What do I need to learn?
Help and comments would be appreciated.
Comments
"What do I need to learn?" - Get a Prop with an SD card interface. Or build one as did ages ago using a DIP Prop on a bread board. Learn to program in Spin. Spin is very easy and should not take to long. Use an SD driver from the Parallax OBEX and do some experiments reading and writing files from SD.
It's all good fun but I'm not sure it will get you where you want to go. However, having learned all that you will very likely find other interesting ideas to pursue.
By the way, how does wikipedia store it's data? I guess they have some database system not just raw text files hanging around. The best way to mirror wikipedia surely would be to use the same db on a machine(s) that can run it.
Learn about database indexing; that would be the number one thing to get right for this to have any chance of success, in my opinion. Plus get a lot of practical experience using a propeller and a few sd cards; that's the other number one thing.
Once it's working, keeping the data updated with changes might be challenging, though.
This is totally impractical.
However, I do urge Broker to get down with the Prop and an SD card and see what can be done.
I also have a device called a WikiReader that I bought for around $15. It has a proprietary file format, but the software and hardware are both open source, so it wouldn't be to difficult to implement on a Propeller microcontroller system.
Assuming you have bolted the required amount of memory and file system storage to your Propeller to do all that, plus run an OS to support it all, it's still going to take forever.
Perhaps the best way to do this is to write a MIPS emulator for the Prop, which can be used to run Linux, which can then do the rest of the job.
If you have time to wait....
How much does a 1 TB SD card cost?
You said, "contain a propeller and a 1TB SD-card". Well that made me laugh as it is just possible to buy 512GB SD cards for around $600 or so, but not 1TB. Then you mentioned a propeller which we are all great fans of but it is not the chip to use for accessing huge amounts of data. Now talking about huge amounts of data, what use is it unless it can use it in an intelligent manner, otherwise it is nothing more than a reader where you have to type in what you are looking for. Then you are making an intelligent robot that can retrieve wikipedia data on "command" etc...
I wish you all the best in just getting a 1TB SD card. Once they are available and you have saved up for that then you can proceed to the next step, which BTW, would NOT be a Propeller of any kind. Perhaps a Propeller can run the robot systems, but it is very ill suited to database processing, especially huge databases.
But by all means get a Propeller with an SD card and play, it's fun.
Please be patient.
I still want to place wikipedia on an sd-card and use a propeller to access it.
Ive installed the large wikipedia version using kiwix. its fantastic.
Now I need to somehow hack or reverse engineer some code perhaps bypass kiwix.
btw, kiwix wikipedia is around 61GB which means it is heavily compressed and to decompress on the fly requires both high processing speed and lots of RAM. This is not a job for a microcontroller like the Propeller.
Now you need to somehow work out how you can reverse engineer your own claims to better understand why you need to talk about the fact that you can't do what you would like to do that can't be done the way you would like it In other words, examine your motives and find some more immediate and realistic goals that you can set for yourself and work from there.
Most likely so. I have seen 128GB on the store shelf, 512GB, and 1TB (1024GB) on the internet. The 1TB cards were listed for $39.99, in "no name" packaging, which makes me somewhat suspicious.
On top of that I think trying to store the entire wiki on a single card and accessing the data via a single propeller or other cpu would be dreadfully slow. Better to use multiple cards and cpu/cogs and access multiple cards in parallel
You can find them on eBay and Amazon. Read their reviews.
One on eBay advertises 90MB/sec data transfer.
At that speed it would take 131 DAYS to fill the 1TB...
Yep, which is why I am somewhat suspicious of them. If I were to attempt such a large data storage and retrieval task using propellers (highly unlikely) it would be with multiple smaller cards and at least one cog per card for reading and one for searching.
-Phil
https://akhenakh.github.io/gozim/